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SYNTHETIC INTERNAL RIBOSOME ENTRY SITES 
AND METHODS OF IDENTIFYING SAME 

This application claims flie b^efit of priority under 35 U.S.C. § 11 9(e) of 
5 U.S. Serial No. 60/230,956, filed September 7, 2000; U.S. Serial No. 60/230,852, 
filed September 7, 2000; U.S. Serial No. 60/207,804, filed May 30, 2000; U.S. Serial 
No. 60/186,496, filed March 2, 2000; U.S. Serial No. 60/178,816, filed 

January 28, 2000; and U.S. Serial No (attorney docket SCRIP1370), filed 

January 12, 2001,each of which is incorporated herein by reference. 

iO 

This invention was made in part with government support under Grant 
No. MCB9982574 awarded by the National Sciouse Foundatioa The government has 
certain rights in Has invention. 



IS BACTCGROUND OF THE INVESHQM 

FIELD OF THE INVENTION 

The present invention relates genacally to conipositions for modulating the level 
of translation of a polypeptide, and more specifically to synthetic internal ribosomie entry 
site (IRES) nucleotide sequence and to metiiods of identifying a synthetic IRES. 

20 

BACKGROUND INFORMATION 

Molecular biology provides tools for genetically engineering organisms to 
express large amounts of useful gene products, including new and useful gene 
products. As such, medically useful gene products such as growth factors, hormones, 
25 and immunoregulatory polypeptides, for example, interferons and interleukins are 
now available in sufficient qxiantities for therapeutic use. In addition, molecular 
biology has allowed gene therapy to evolve to a form that it promises to provide new 
ways for doctors to manage previously unt^atable conditions. 



30 



Although molecular biology currently provides tools for expressing 
recombinant proteins in the cells of bacteria, plants, and animals, including himian 
cells, methods for manipulating the amount of a protein that is expressed in the cells 
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are limited. For example, the recombinant protein can be e:!q>ressed fiom a particular 
gene promoter, that has a known level of activity or that is expressed only in one or a 
few cell types. However, while, the use of a particular promoter may result in the 
e7q>ression of a particular amount of RNA encoding the protein, the amount of protein 
5 translated from the RNA does not always correlate directiy with the amount of RNA 
that is expressed For example, some of tiie RNA may be degraded in the particular 
cell, or the RNA may not be translated efficientiy. In other cases, the protein may be 
translated in an amount that is toxic, either to the cell expressing the protein or to 
another cell that is in proximity to a cell that produces and secretes the protein, 

0 

Nucleotide sequences that are involved in regulating the translation of a 
polypeptide have been described, and can be utili2:ed to help regulate the amount of a 
protein translated from an RNA molecule. Some of these nucleotide sequences are 
frmctional only when positioned at the 5* end of an RNA molecule, and are usefid for 

S regulating the e7q)ression of a protem encoded by the nucleotide immediately 
downstream of the translation regulatory element However, such nucleotide 
sequences are not useful for regulating the expression of a number of proteins that are 
encoded by a single RNA molecule containing a series of open reading frames. For 
this purpose, nucleotide sequences that can regulate translation from within a 

0 nucleotide sequence can be useful. However, only a few such sequences, referred to 
as intenialTib^some binding sites (IRES), have been described, and they generally are 
very large, containing more than a hundred, generally several hundred, nucleotides. 

The large size of the known IRES sequences limits their usefulness. For 
5 example, many gene therapy vectors such as retrovirus vectors are limited in the size 
of a polynucleotide insert that can be contained in and expxQssed from the vector. As 
such, polynucleotides that encode very large proteins cannot be used with certain 
vectors. The use of an IRES, which can consist of several hundred nucleotides, only 
further limits the size of the encoding polynucleotide that can be included in the 
0 vector, thus further limiting the polynucleotides that can be used in a gene therapy 
procedure. Furthermore, the use of a single IRES may not result in sufficient 
expression of a desired polypeptide. Thus, a need exists for regulatory elements that 
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are useful for modulatiiig the level of protein expression. The present invention 
satisfies this need and provides additional advant^es. 

SUMMARY OF THE TNVENTTON 
5 The present invention relates to an isolated synthetic transladonal regulatory 

element, yAAoh includes at least one oligonucleotide consisting of about 6 to 125 
ribonucleotides, or a deoxyribonucleotide sequence encoding the oligonucleotide. The 
oligonucleotide is characterized, in part, in having translational regulatory activity, for 
exanq>le, translational enhancmg activity, translational inhibitory activity, internal 
1 0 ribosome entry site (IRES) activity or a combination thereof and has such translational 
regulatory activity in a eukaryotic cell. 

A synthetic translational regulatory element of the invention is exemplified 
herein by an oligonucleotide encoded by any of SEQ ID NOS: 42 to 46, 49, 50, 52, and 

15 89 to 160, which were identified by screening libraries of randomized oligonucleotides. 
In addition, a synthetic translational regulatory element is exemplified herein by an 
oligonucleotide that is complementary to an oUgonucleotide sequence of a ribosomal 
RNA, particularly to an oligonucleotide sequence of an un-base paired region of 
ribosomal RNA, for example, an oUgonucleotide encoded by any of SEQ ID NOS: 2, 

20 30, 32, 34, 36, 38, 40 and 50. 

A synthetic translational regulatory element of the invention can contain two or 
more oligonxicleotides having translational regulatory activity, \A4ierein the 
oligonucleotides are operatively linked to each other, and can be the same or different 

25 Exanq)les of such synthetic translational regulatory elements include those encoded by a 
nucleotide sequence selected firom any of SEQ ID NOS: 42 to 46, 49, 50, 52, and 89 
to 160. In one embodiment, a synthetic translational regulatory element contains five 
operatively linked oUgonucleotideshaviiig translational regulatory activity. In another 
embodiment, a synthetic translational regulatory element contains ten operatively linked 

30 oligonucleotides having translational regulatory activity. In still anotho: embodiment, a 
synthetic translational regulatory element contains up to fifty operatively liiiked 
oUgonucleotides having translatioiial regulatory activity. Such oUgonucleotides in a 
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synthetic translational regulatory element can be adjacent to each other, or can be 
separated from each other by a spacer nucleotide sequence, ^ch can consist of about 
1 to 100 ribonucleotides, for example, the synthetic translational regulatory elements 
encoded by any of SEQ ID NOS: 53 to 88. 
5 

The present invention also relates to a vector containing a synthetic translational 
regulatoiy element of the invention. In one embodiment, the vector is an expression 
vector, which can contain a translation initiation site in addition to the sfynthetic 
translational regulatory element, which can have, for example, translational enhancing 

10 activity. Inaddition,the vector can contain a translation start codonoperatively linked to 
the translation initiation site. In another embodiment, the expression vector contains an 
expressible polynucleotide, \siierein tiie translational regulatory element is operatively 
linked to the e:qn:essible polynucleotide, and wherein the translational regulatory 
element has transcriptional enhancing activity or IRES activity. The expressible 

1 5 polynucleotide can include one cistron or can be polycistronic, for example, dicistronic. 
Also provided is a host cell containing a synthetic translational regulatory element of the 
inventioa 

The present invention also relates to an isolated translational regulatory element, 
20 which includes at least one 5' untranslated region (5* UTR) of a eukaryotic messenger 
RNA (mRNA)"orah1i)ligonucleotide portion thereof, or a deoxyribonucleotide sequence 
encoding said 5' UTR or oligonucleotide portion thereof The 5' UTR or oligonucleotide 
portion thereof is characterized, in part, in having translational enhanmng activity, 
internal ribosome entry site (IRES) activity or a combination thereof, and the 
25 translational regulatory activity is effective in a eukaryotic cell. 

In one embodiment, flie 5* UTR of an isolated translational regulatory element is 
encoded by a nucleotide sequence selected from any of SEQ ID NOS: 1, 23 to 29, 161, 
162, and 164, and the 5' UTR or oUgonucleotide portion thereof has translational 
30 enhancing activity or IRES activity or botL An oligonucleotide portion of a 5' UTR in 
such an isolated translational regulatory element is exemplified by an oligonucleotide 
encoded by any of nucleotides 1 to 40, 1 to 81, 1 to 120, 41 to 81, 14 to 196, 80 to 120, 
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80 to 196, 120, to 166, and 120 to 196 of a Gtx homeodDmain 5' UTR (SEQ ID NO: 1). 
A particular oligonucleotide portion of the Gtx 5' XJTR is shown as SEQ ID NO: 2, 
which can be linked at its upstream (5*) end or downstream (30 end or boA ends to a 
spacer nucleotide sequence, which can be about 1 to 100 ribonucleotides in length, for 

5 example, an oUgonucleotide encoded by SEQ ID NOS: 4, 5, 12, and 20 to 22. An 
isolated lianslational regulatory element also can contain two or more oUgonucleotide 
portions encoded by SEQ ID NO: 2, wMch are operatively linked to each otiier, for 
example, five or ten of tiie oligonucleotides, in operative linkage, and each of flie 
oligonucleotides (SEQ ID NO: 2) can, but need not, be separated fiom each otiier by a 

10 spacer micleotidesequMice. Examples of such isolated translational regulatory elements 
are encoded by any of SEQ ID NOS: 6 to 1 1 and 13 to 15. 

Additional examples of an oligonucleotide portion of a 5' UTR having 
translational regulatory activity includmg oligonucleotide portions of the yeast YAPl 

15 mRNA(SEQIDNOS: 30 and 32), and oligonucleotide portions of tiie yeast pl50 

mRNA, including any of nucleotides 1 to 250, 100 to 508, 160 to 508, 250 to 508, 375 
to 508, 429 to 508, 481 to 508, and 250 to 390 of SEQ ID NO: 29; or any of SEQ ID 
NOS: 36, 38 and 40. In addition, oligonucleotide sequences of the mouse Rbm3 
mRNA, including any of SEQ ID NOS: 165 to 169, 171 to 176, and 182 to 192 have 

20 translational regulatory activity. An isolated translational regulatory element also can 
mclude at least two 5' UTRs or oligonucleotideiKjrtions tiiereo^ wiierein tiie at least two 
5' UTRs or oligonucleotide portions are operatively Knked to each otiier, and wherein 
each of the at least two 5' UTRs or oligonucleotide portions indepraidentiy is the same or 
different fixnn each other. 

25 

In anotiier embodiment an isolated translational regulatory el^ient mcludes an 
oligonucleotide portion of a 5' UTR encoded by any of SEQ ID NOS: 1, 23 to 29, 161, 
162, and 164, \<*etein said oligonucleotide portion tiiereof has translational inhibitory 
activhy. Examples of such oligonucleotides having translational inhibitory activity 
30 include tiiose encoded by nucleotides 120 to 196 and 167 to 196 of tiie CMx 

homeodomain jootem mRNA (SEQ ID NO: 1), tiiose emcoded by nucleotides 1 to 100 
and 100 to 160 of tiie yeastplSO mRNA (SEQ ID NO: 29), tiie portion of tiie yeastplSO 
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mRNA sho^ as SEQ ID NO: 34, and those encoded by SEQ ID NO: 1 93 and SEQ ID 
NO: 194, which encode portions of Ihe mouse Kbm3 mRNA. 

The present invention also relates to a kit, vMch contains a synthetic 
5 translational regulatory element or an isolated translational regulatory element of the 
invention. In particular, a kit of the invention can contain a plurality of syntiietic 
transcriptional regulatory elements, which are the same as or different from each other, 
and which can contain a flanking sequence independently at a S' end or a 3' end or both 
S' and 3' ends, to facilitate operatively linking two or more synthetic translational 
0 regulatory elemmts in the plurality to each other. 

The present invention further relates to a recombinant nucleic acid molecule, 
which contains a synthetic translational regulatory element operatively linked to an 
expressible polynucleotide. The expressible polynucleotide can comprise a cistron, and 

S the synthetic translational regulatory element has translational enhancing activity or 
IRES activity. The expressible polynucleotide also can comprise, in operative linkage in 
a 5' to 3* orientation, a first cistron, a spacer nucleotide sequence, and a second cistron, 
and the synthetic translational regulatory element can have IRES activity, and be 
operatively linked to the second cistroiL The synthetic translational regulatory element 

0 in a recombinant nucleic acid molecule of the invention can include two or more 
, operatively lmkedoUgonucle6fi^Ma%translatioiklregulrt^ 

The polypeptide encoded by a recombinant nucleic acid molecule of the 
invention can be any polypeptide. A polypeptide encoded by a recombinant nucleic 
5 acid molecule can be any other polypeptide, for example, a viral or bacterial 
polypeptide or an epitope expressed by a virus, bacterium or other pathogenic 
organism; or a growth factor, a hormone and a receptor for a growth factor or a 
hormone. The recombinant nucleic acid molecule also can encode two or more 
polypeptides. 

0 

A particularly useful polypeptide that can be expressed by a recombinant 
nucleic acid molecule of the invention, or that can be encoded by an expressible 
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polynucleotide useful in a composition or method of the invention is a teporter 
molecule, v^*ich provides a means to detect, select or isolate, for example, a cell 
expressing the reporter polypeptide. Reporter molecules (polypeptides) are well known 
in the art and include, for example, fluorescent polypeptides such as green fluorescent 

5 protein, cyan fluorescent protein, red fluorescent protein, or enhanced forms tiiereofc 
an antibiotic resistance polypeptide such as puromycin N-acetyltransferase, 
hygromycin B phosphotransferase, neomycin (aminoglycoside) phosphotransferase, 
and the Sh ble gene product, a cell surface protein marker such as die cell surface 
protein marker is neural cell adhesion molecule (N-CAM), an enzyme such as p- 

10 galactosidase, chloramphenicol acetyltransferase, luciferase, and alkaline 

phosphatase, or a peptide t^ such as a c-myc peptide, a polyhistidme, or the like. 

Expression of a reporter molecule can be detected using the ^propriate 
reagent, for example, by detecting fluorescence of a green fluorescent protein or light 

15 emission upon addition of luciferin to a luciferase reporter molecule, or by detecting 
bindmg of nickel ion to a polypeptide containing a polyhistidme tag. Furthermore, 
the reporter molecule can provide a means of isolating the expressed reporter 
molecule or a cell expressing tiie reporter molecule. For example, where the reporter 
molecule is a polypeptide tiiat is expressed on a cell surface and that contams an 

20 operatively Imked c-myc epitope, an anti-c-myc epitope antibody can be immobilized 
on a solid matrix and cells, some of which express the tagged polypeptide, can be 
contacted with the matrix under conditions that allow selective bmding of the 
antibody to the epitope. Unbound cells can be removed by washing the matrix, and 
bound cells, which express the reporter molecule, can be eluted and collected. 

25 Methods for detecting such reporter molecules and for isolating the molecules, or 
cells expressing the molecules, are well known to tiiose in the art (see, for example, 
Hopp et al., RioTechnologv 6:1204, 1988; U.S. Pat. No. 5,011,912; each of which is 
incorporated herein by referrace). 

30 The present mvention also relates to a method of producing a genetically 

modified cell that exhibits altered ejqpression of a polypeptide. The metiiod can be 
performed, for example, by introducing a syntiietic translational regulatory element of 
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the invention into a cell, whereby the synthetic transnational regulatory element 
operadvely linked to a nucleotide sequence encoding a polypeptide, thereby 
producing a genetically modified cell that exhibits altered expression of a polypeptide. 
In one embodiment, the synthetic translational regulatory element is operatively 
5 linked to the nucleotide sequence prior to introducing the synthetic translational 

regulatory element into the cell, wherein the synthetic translational regulatory element 
has translational enhancing activity or IRES activity. The polypeptide can be any 
polypeptide, for example, a reporter polypeptide, is a toxin, or a therapeutic agent. 
The synthetic translational regulatory element can be stably maintained in the cell, for 
10 example, by integration into the cell genome* 

In another embodiment, the nucleotide sequence to which the synthetic 
translational regulatory element is operatively linked is a sequence of an endogenous 
gene in the cell genome. The synthetic translational regulatory element can have 

1 S translational inhibitory activity, translational enhancing activity or IRES activity. 
Accordingly, the present invention also relates to a genetically modified eukaryotic 
cell produced by a method of the invention, as well as to a transgenic non-human 
eukaryotic organism containing or derived from such a genetically modified cell, a 
cell or tissue obtained from the transgenic non-human eukaryotic organism, and a 

20 cDNA or genomic DNA library prepared from the transgenic non-human eukaryotic 
organism or from a cell or tissue obtained'~:@om said transgenic non-human eukaryotic 
organism. 

The present invention further provides a method of altering translational activity 
25 in a eukaryotic cell, the method comprising introducing into the cell a synthetic 

translational regulatory element, vsdiereby the synthetic translational regulatory element 
internets with a translation regulatory fector in the cell, tihereby altering translational 
activity in the eukaryotic cell. In addition, the invention provides a method of altering 
translational activity in a eukaryotic cell, the method comprising introducing into the cell 
30 an isolated translational regulatory element, whereby the synthetic translational 
regulatory element interacts with a translation regulatory factor in the cell, thereby 
altering translational activity in the eukaryotic cell. The translational regulatory element 
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can have transladonal enhancing activily or IRES activity, whereby translational activity 
in the cell is decreased, or can have translational inhibitory activity, \^ereby 
translational activity in the cell is decreased 

5 The present invention also provides a method of improving proteui yield by a 

eukaryotic cell. Such a method can be performed, for example, by introducing into 
the cell a recombinant nucleic acid molecule containing a translational regulatory 
element operatively linked to an expressible polynucleotide, and expressing a 
polypeptide encoded by the expressible polynucleotide in the cell. The translational 

10 regulatory element generally has translational enhancing activity or IRES activity. In 
one embodiment, the expressible polynucleotide comprises a first cistron encoding a 
polypeptide that enhances protein stability or cell viabiHty, and can also comprises a 
second cistron, which can encode any polypeptide of interest Where the e^ressible 
polynucleotide contains two, or more, cistrons, the second, and other, dstron is 

1 5 operatively Imked to the first cistron. Preferably, such an expressible polynucleotide 
contains one or more IRES elements, which can be operatively linked to the first 
cistron or the second or other cistrons or to all of the cistrons, and can be operatively 
linked to each other to form a concatemer of IRES elements. In such a mefliod, the 
polypeptide that enhances protein stability or cell viability can be any polypeptide, for 

20 example, a chaperone protein, a heat shock protem, a protein having anti-oxidant 
activity, a protease inhibitor, a phosphatase inhibitor, a caspaseMiibitor, or an 
antibiotic peptide such as a magainin, a defensin, or a cryptdin. 

In another embodiment, the expressible polynucleotide comprises a first 
25 cistron encoding a polypeptide that enhances transcription or translation of a 
polynucleotide in the cell, for example, a transcription factor or a translation 
regulatory factor such as eukaryotic initiation factor or ribosomal protein or the like. 
The expressible polynucleotide also can comprise a second operatively linked cistron 
encoding a polypeptide of interest, vdiich can be under the control of an IRES element 
30 of the mvention, and the polypeptide that enhances transcription or translation of a 
polynucleotide can be a polypeptide that enhances transcription or translation of the 
expressible polynucleotide. 
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The present invention further relates to a method of expressing a polypeptide 
in a eukaryotic cell. Such a method can be performed, for example, by introducing 
into the cell a recombinant nxicleic acid molecule comprising a translational 
5 regulatory element operatively linked to an expressible polynucleotide, wherein the 
translational regulatory element has an activity selected from translational enhancing 
activity and IRES activity, and expressing the expressible polynucleotide in the cell. 
The expressible polynucleotide can comprise a first cistron encoding a therapeutic 
polypeptide or a reporter polypeptide, which can be useful as a diagnostic agent. A 
1 0 therapeutic polypeptide can 6e, for example, an immimomodulator such as a cytokine, 
a neuromodulator, a hormone such as preproinsulin, a growth factor such as epidermal 
growth factor, a growth fector receptor such as a PDGF receptor, an apoptotic 
polypeptide such as Bax, an anti-apoptotic polypeptide such as Bcl-2, or an antibiotic. 
The expressible polynucleotide also can contain a second cistron encoding a 
1 5 polypeptide that facilitates expression of the therapevrtic or diagnostic polypeptide or 
fliat facilitates activity of the therapeutic polypeptide. A polypeptide that facilitates 
expression or activity of such a polypeptide can be, for example, a transcription 
factor, which can increase transcription of the expressible polynucleotide; a 
translational regulatory factor, which can increase translation of the encoded 
polypeptide; a chaperone protein, a protein having anti-oxidant activity, a protease 
inhibitor, or a phosphatase inhibitor. — - 

According to a method of the invention, the expressible polynucleotide can be 
introduced into a cell ex vivo, for example, into a cell in culture, or can be introduced 
into a cell in vivo in a eukaryotic organism, including a mammalian organism such as 
a human. As such, the recombinant nucleic acid molecule can be contained in a 
vector, which can be a plasmid vector or a viral vector such as an adenovirus vector, 
an adeno-associated virus vector, or a retrovirus vector. Such a method can be usefid 
for treating or diagnosing a pathologic condition, or to monitor the progression of a 
pathologic conditioa 
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The present inventloii also relates to a method of identifying a cell. Such a 
method can be performed by introducii^ into the cell a recombinant nucleic acid 
molecule comprising a translational regulatory elraient of &e invention operatively 
linked to an expiessible polynucleotide, wherein the translational regulatory element 

S has an activity selected &om translational enhancing activity and IRES activity, 

wherein the e3q)ressible polynucleotide comprises at least one cistron, which encodes 
a first reporter polypeptide, and wherein expression of the reporter i)olypeptide in the 
cell provides a means to identify the cell. If desired, the expressible polynucleotide 
also can contain a second cistron encoding a polypeptide of interest, in which case the 

1 0 second cistron is operatively linked to the first cistron, and the expressible 

polynucleotide also contains a synthetic IRES element of the invention, which can be 
operatively linked to tiie first cistron or the second cistron or both. The polypeptide of 
interest can be any polypeptide, including a second reporter polypeptide, vMoh can be 
the same as or different fix>m the first reporter polypeptide. 

15 

A method of identifying a cell as disclosed herein can further include a step of 
isolating a cell expressing a reporter polypeptide. In one embodiment, the step of 
isolating utilizes a reporter polypeptide expressed from the expressible 
polynucleotide. Thxis, the reporter polypeptide can be, for example, a cell surface 

20 marker, wherein isolating the cell expressing the cell surface marker utilizes an 

antibody thatspecifically binds the cell surface marker. A cell suifex^^arker can be 
a polypeptide that is nomially expressed on a cell surface, for example, N-CAM, and 
the antibody can be an anti-N-C AM antibody, or the cell surface marker can comprise 
a peptide tag, which is expressed as part of a cell surface protein, in which case the 

25 cell can be isolated using an antibody specific for the cell surface protein, or an 
antibody or other ligand specific for the peptide tag. Accordingly, the present 
invention also provides an isolated cell obtained according to a method of the 
invention. 
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BRIEF DESCRIPnON OF TTTO FTRURES 
Figure 1 illustrates the structure of Gtx (SEQ ID NO: 1) deletion constructs used 
to identify sequences having translational regulatory activity, and shows the'relative 
BRES activity as determined using the RPh construct (see Example 1). Gtxl-196/RPh 
5 indicates a portion of the RPh plasmid containing a nucleotide sequence encoding the 
196 nucleotide Gtx 5' untranslated region (5* UTR). Nucleotides included in the various 
constructs are indicated. Relative IRES activity is shown as a percent of the activity of 
the fiill length Gtx 5' UTR. Numbers in parentheses indicate standard error. 

1 0 Figure 2 shows a portion of the ribonucleotide sequences of various Gtx IRES 

module (SEQ ID NO: 2) constructs. SEQ ID NOS: of encoding deoxyribonucleotide 
sequences are indicated in parentheses. Nucleotide sequences as shown correspond, for 
example, to nucleotides 1 to 27 of SEQ ID NO: 6 (5 copies of wild-type Gtx IRES), 
SEQ ID NOS: 17-19, containing 1 , 2 or 3 substitutions at the 3' end of each Gtx IRES 

IS module in the repeated unit, and SEQ ID NOS: 20-22, containing 1, 2 or 3 substitutions 
at tiie 5' end of the spacer sequence in each repeated unit Relative IRES activity as a 
ratio of Renilla luciferase activity to Photinus luciferase activity (R:P) is shown, 
nomialized to R:P activity of the RPh vector, alone. 



20 Figure 3 shows regions of complementarity between sequences of yeast 

18SribosomalRNA(SEQIDNOS: 31, 33, 35, 37 and 39)"aaa-oligonucleotide portions 
of tiie yeast YAPl 5' UTR (SEQ ID NOS: 30 and 32) or tiie yeast pl50 5* UTR (SEQ ID 
NOS: 34, 36, 38 and 40, Nucleotide sequences of the yeast YAPl 5' UTR (SEQ ID 
NO: 28), yeast pl50 5' UTR (SEQ ID NO: 29), and 18S ribosomal RNA are indicated. 

25 

Figure 4 is a map of the MESVR/EGFP/ECFP/RSVPro(ori) vector (SEQ ID 
NO: 16) showing the positions of unique restriction endonuclease recognition sites. 

Figure 5 shows a region of complementarity of 18S ribosomal RNA (SEQ ID 
30 NO:50) and a synlhetic IRES element, ICSl-23 (SEQ ID NO: 49). "a" and "b" indicates 
portions of SEQ ID NO: 49 that were examined separately for IRES activity. SEQ ID 
NOS: are indicated in parentheses. 
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Figure 6 shows deletion constructs of a nucleotide sequence (SEQ ID 
NO: 164) encoding the mouse Rbm3 5' UTR, and illustrates a portion of the RPh 
vector in which the sequences were examined for IRES activity. Nucleotide positions 

5 relative to the 5' UTR (SEQ ID NO: 164) are indicated in parentheses to the left, and 
SEQ ID NOS: are indicated in parentheses to the right of the figure. Adenosine 
residues were added to the 3' deleted sequences (SEQ ID NOS: 181-190) such that the 
relative distance to the natural Rbm3 AUG start codon was maintained (indicated in 
Figure as, for example, "(A)40"; see SEQ ID NO: 181). Normalized IRES activity of 

10 the various constructs is shown (see description of Figure 4. above). 

Figures 7A and 7B illustrate sites in vMch IRES modules of the invention 
share complementarity to mouse 18S ribosomal RNA (rRNA; SEQ ID NO: 196). 

1 5 Figure 7 A provides a linear representation of tide 1 8S rRNA, ttie vertical lines 

below ibe Imear representation are sites at vMch selected IRES modules share 8 or 
9 nucleotides of complemratarity with the to 18S rRNA sequence. 

Figure 7B shows a secondary structure of the 18S rRNA, and the dark bars 
20 indicate the positions of the complementary sequence matches to selected IRES 
modules of the invention. 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention provides a synthetic translational regulatory element, 
25 which includes at least one oligonucleotide consisting of about 6 to 125 

ribonucleotides, or a deoxyribonucleotide sequence encoding the oligonucleotide. 
The oligonucleotide is characterized, in part, in having translational regulatory 
activity, for example, translational enhancing activity, translational inhibitory activity, 
internal ribosome entry site (IRES) activity or a combmation thereof, and has such 
30 translational regulatory activhy in a eukaryotic cell. 
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As disclosed herein, a synthetic translational regulatoiy element can be 
identified by screening libraries of randomized oligonucleotides and identifying those 
havmg translational enhancmg activity, translational inhibitory activity, IRES activity, 
or a combination thereof in a eukaryotic cell. In addition, a synthetic translational 
5 regulatory element can be designed based on the nucleotide sequence of a ribosomal 
RNA (rRNA; see SEQ ID NOS: 195-197; GenBank Accession Nos, V01335, 
X00686, X03205, respectively, each of which is incorporated herein by reference). In 
particular, oligonucleotides to be examined for translational regulatory activity, 
including IRES activity, can be designed so as to be con^lementary to an 

10 oligonucleotide sequence of rRNA, pref^iably an un-base paired oligonucleotide 

sequ^ce of rRNA, vAdch is not generally part of a double stranded region tibat forms 
due to hybridization of self-complementary sequence of tiie rRNA (see Figure 7B)» 
Furthermore, the a synthetic translational regulatory element can be identified by 
screening a variegated population of oligonucleotide sequences (see, for example, U.S. 

1 5 Pat No. 5,837,500), which can be based, for example, on a translational regulatory 
element of the invention as disclosed herein or on an oligonucleotide sequence 
complementary to an un-base paired region of a rRNA. 

The present invention also provides an isolated translational regulatory element, 
20 which includes at least one 5' untranslated region (5* UTR) of a eukaryotic messenger 
RNA (mRNA) or an ohgonucleotide portion thereof, or a deoxyfibDHG©K5tide sequence 
encoding said 5* UTR or oligonucleotide portion thereof The 5* UTR or oligonucleotide 
portion thereof is characterized, in part, in having translational enhancing activity, 
internal libosome entty site (IRES) activity or a combination thereof, and the 
25 translational regulatory activity is effective in a eukaryotic cell. 

As used herein, the term "isolated," when used in reference to a translational 
regulatory element or a nucleotide sequence encoding the element, indicates that the 
nucleotide sequence is in a form other than the form in which it is found in nature. 
30 Thus, a nucleotide sequence encoding an isolated translational regulatory element is 
separated, for example, jfrom a gene in which it normally can be found in nature, and 
particularly firom a chromosome in a cell, and an isolated translational regulatory 
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element is separated, for example, from an mRNA that normally contains the element 
It shoxild be recognized, however, that a translational regulatory element of the 
invention can be linked to additional nucleotide sequences, yet still be consid^ed 
"isolated" provided the construct comprising the regulatory element is not in a form 
5 that is found in nature. Thus, the translational regulatory element can be contained 
within a vector, or can be operatively linked to a second nucleotide sequence, for 
example, anotiier regulatory element or an e^qiressible polynucleotide. 

The term "synthetic" also is used herein to refer to a translational regulatory 

10 element of the invention. For convenience, the term "synthetic" is used herem to refer 
to a translational regulatory elraient that is identified by screening a library of 
oligonucleotides, which can be randomized oligonucleotides, variegated 
oligonucleotides, or the like. In comparison, the term "isolated translational 
regulatory element" is used to refer to an element that is identified by examining a 

1 5 naturally occurring nucleotide sequence, particularly a 5' UTR of an mRNA, vMch 
can be a 5' UTR of a known mRNA, or can be identified by screening a cDNA 
library, particularly those portions of the cDNA encoding mRNA 5' UTRs, or by 
screening a genomic DNA library. Nevertheless, it should be recognized that the 
means of producing or identifying a translational regulatory element of the invention 

20 is not relevant to the composition, activity or use of the element. As such, an isolated 
translational regulatory element, having been identified from an mRNA, for example, 
can be produced using routine chemical or biochemical methods of nucleic acid 
synthesis and, similarly, a library of oligonucleotides used to identify a synthetic 
regulatory element can contain oligonucleotides that correspond to translational 

25 regulatory elements that also can be found in nature. 

As disclosed herein, the translational regulatory elements of the invention can 
have translational enhancing or translational inhibitory activity, and also can have 
IRES activity, which provides a means to effect cap-independent translation. In 
30 eukaryotes, the mitiation of mRNA translation is believed to occur generally by a cap- 
binding/scanning mechanism, although some mRNAs are translated efficiently 
despite the lack of a cap structure or a free 5* end (Kozak, Gene 234:187-208, 1999; 
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Gingras et al., Ann. Rev, Riochem 68:913-963, 1999). In some of these cases, 
sequences contained within the mRNA nucleotide sequence directly recruit the 
translation machinery. These sequences, IRESes, are d^n^ functionally using 
dicistFonic mRNAs, wherein a nucleotide sequence is considered to function as an 
5 IRES if, when present in the intercistronic region of a dicistronic mRNA, it directs 
translation of the second cistron in a manner that is independent of the first cistron. 

IRESes have been identified in the 5' UTRs of cellular and viral mNRAs, and 
have been extensively characterized m picomaviral mRNAs, where they comprise 

0 well defined segments of about 450 nucleotides (Nicholson et al., J. Virol. 65:5886- 
5894, 1 991). These IRESes have been categorized on the basis of sequence and 
structural similarities (Jackson and Kammski, EUA 1 :985-1000, 1995). In contrast, 
the IRES elements of cellular mRNAs are not as well characterized, and in many 
cases distinct boundaries have been difficult to determine by deletion analysis 

5 (Stoneley et al., QnfiQgfing 16:423-428, 1998; Huez et al., Mol. CelL Binl. 18:6178- 
6190, 1998; Gan et al., J. Biol, Chem. 273:5006-5012, 1998; Bernstein et al., JLEifiL 
ChaOL 272:9356-9362, 1997, each of which is incorporated herein by reference). 
However, a 55 nucleotide sequence of the Drosophila Antennapedia 5* UTR has been 
reported to have IRES activity (Oh et al.. Genes DeveL 6:1643-1653, 1992, which is 

0 incorpomted herein by reference). 



Cellular IRESes do not contain any obvious sequence similarity to each other 
or to picomaviral IRES sequences. It has been suggested that IRESes may resraible 
each other in secondary structure (Le and Maizel, NucL Acids Res. 25:362-369, 
1997). RNA folding analyses have indicated that some cellidar IRESes may contain a 
Y-type system-loop structure, followed by a stem-loop immediately upstream of the 
initiation codon. This Y-shaped conformation was predicted to occur in IRES 
sequences of mRNAs encoding the immunoglobulin heavy chain binding protein 
(BiP), fibroblast growdi factor-2, and the Antennapaedia gene product, and is 
reportedly shnilar to one contamed within picomavmis, pestivirus, and hepatitis C 
vuiis IRESes (Le et al. Virus Genes 12:135-147, 1996). RNA conformations, 
including Y-shaped secondary structures, also have been predicted to occur within 
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several otiier cellular IR]^ sequences (Stoneley et aL, siqjra, 1998; Huez et al., supra, 
1998; Sella et al., Mol. Cell. Biol. 19:5429-5440, 1999; Nanhra et al., J.Bipl.Chem, 
272:32061-32066, 1997). In the o-sismRNA, for example, three independent 
fragments have IRES activity, and each fragment is proposed to contain a Y-type 
5 stem-loop structure (Sella et al., supra, 1999), 

No independent physical evidence has demonstrated that the predicted 
Y-shaped KNA conformations occur in nature, and for many cellular IRESes, the 
ability to internally mitiate translation is not correlated with the presence of the 

10 proposed secondary structure. For example, 5' or 3' deletions of the BiP IRES 

mamtain IRES activity even though the deleted fragments do not contain the predicted 
Y-type stem loop structure (Yan and Ramftw, Nud. Acids Res. 25:2800-2807. 1997). 
This lack of correlation between proposed RNA conformations and IRES activity has 
also been observed in othor studies. For example, the vascular endothelial growth 

1 5 factor and c-myc IRESes each contain two non-overlapping fragments with IRES 
activity (Stoneley et al., supra, 1998; Huez et al., sjqfra, 1998), and the BiP IRES has 
three non-overlapping fragments with IRES' activity, none of vAdch contam tbe 
proposed Y-type stem-loop structure, or, at best, contain only part of it (Yang and 
Samow, supra, 1997). Such observations suggest that cellular IRESes may be 

20 composed of numerous, short segments that have independent IRES activity or Aat 
act together to affect overall IRES activity. 

As disclosed herein, analysis of the Gtx 5' UTR (SEQ ID NO: 1) revealed 
numerous oligonucleotide portions of tiie 5' UTR having transladonal regulatory activity 

25 (see Example 1). Thus, the present invention jmmdes, for example, an isolated 

translational regulatory elanent oicoded by any of nucleotides 1 to 40, 1 to 81, 1 to 120, 
41 to 81, 14 to 196, 80 to 120, 80 to 196, 120, to 166, and 120 to 196 of the Gtx 
homeodomain 5' UTR (SEQ ID NO: 1). Even more remarkably, a 9 nucleotide 
sequence the Gtx homeodomain mRNA 5' UTR (SEQ ID NO: 2; nucleotides 141 of 

30 the 196 nucleotide Gtx 5' UTR; SEQ ID NO: 1) was demonstrated to have the 

characteristics of an IRES element (see Example 1; Komuro et al., ELiffiQJL 12:1387- 
1401, 1993, which is incorporated herem by reference). This GtX IRES module (SEQ 



wo 01/55369 



PCT/USOl/02586 



18 

ID NO: 2) functions independently to enhance translational activity, and multiple 
copies of the IRES module synergistically enhance translational activity (see 
Example 1). Accordingly rthe present invention also provides an isolated translational 
regulatory element encoded by SEQ ID NO: 2. 

As disclosed herein, the Gtx 5' IRES module (SEQ ID NO: 2) can be linked at its 
upstream (5*) end or downstream (3*) end or both ends to a spacer nucleotide sequence, 
which can be about 1 to 100 ribonucleotides m length such that the activity of the IRES 
module can be varied. Thus, the invention provides, for example, a translational 
regulatory element encoded by any of SEQ ID NOS: 4, 5, 12, and 20 to 22, which 
includes a Gtx IRES module (SEQ ID NO: 2) and confers variable levels of IRES 
activity on an operatively linked e}q]resdble polynucleotide. 

An isolated translational regulatory element also can contain two or more of the 
oligonucleotide portions encoded by SEQ ID NO: 2, which are operatively linked to 
each other, for example, five or ten of the oligonucleotides in operative linkage, and each 
of the oligonucleotides (SEQ ID NO: 2) can, but need not, be separated fi'om each other 
by a spacer nucleotide sequence. As disclosed herein, mxiltiple copies of a Gtx IRES 
module exhibit a greater than additive amount of IRES activity (see Example 1). Thus, 
the present invention provides, for example, an isolated translational regulatory elements 
encodedbyanyofSEQIDNOS:6tollandl3tol5. 

The cooperative increase in IRES activity exhibited by multiple copies of the 
Gtx IRES module indicates that cellular IRES activity can depend on the strengtii, 
nimiber and spacing of various individual IRES modules. Photochemical crosslinking 
studies previously demonstrated that this region of the Gtx mRNA sequence binds to 
40S ribosomal subunits by crosslmking to its complement within the 18S rRNA (Hu 
et al., Proc. Nati. Acad. Sci., USA 96:1339-1344, 1999, which is mcorporated herein 
by reference). As disclosed herein, the 9 nucleotide IRES module is 100% 
complementary to the 18S rRNA at nucleotides 1 132 to 1 124. 
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A fbrflier examination of 5' UTRs of eukaryotic mRNAs revealed that 
translational regulatory activity, including IRES activity, was effected by ttie 5* UTRs of 
five dendritically localized mRNAs, including the rat activity related cytoskeletal 
protein (ARC; see SEQ ID NO: 25), rat dendrin (see SEQ ID NO: 24), rat 

5 microtubule associated protein-2 (MAP2; see SEQ ID NO: 26), the rat a-Cam 

Kinase 11 suburdt (CamK Ha; SEQ ID NO: 23), and the rat RC3 neurogranin gene (see 
SEQ ID NO: 27), by two yeast 5' UTRs, yeast YAPl (see SEQ ID NO: 28) and yeast 
pl50 (see SEQ ID NO: 29), by the 5* UTRs of mRNAs encoding the rat amyloid 
precursor protein (APP; see SEQ ID NO: 161), the rat beta-site APP cleaving enzyme 

10 (Bace; SEQ ID NO: 162), the human FMR protein (see SEQ ID NO: 164), and the 
mouse Rbm3 protein (see SEQ ID NO: 164), each of which internally initiated 
translation when placed in the hitercistronic region of a dicistronic RNA (see 
Bcamples 2 and 3). Accordingly, in one embodiment, the present invention provides 
an isolated translational regulatory element encoded by a nucleotide sequence selected 

15 fiom any of SEQ ID NOS: 1, 23 to 29, 161 to 164 

Detailed analysis of two of the 5* UTRs, yeast pi 50 and mouse Rhm3, confirmed 
the results obtained for the Gtx 5* UTEL Thus, oligonucleotide portions of the yeast pi 50 
5* UTR, including, for example, those encoded by any of nucleotides 1 to 250, 100 to 

20 508, 160 to 508, 250 to 508, 375 to 508, 429 to 508, 481 to 508, and 250 to 390 of 
SEQ ID NO: 29 (see Example 3); and oligonucleotide portions of the mouse Rbm3 
5' UTR, including, for example, those encoded by any of SEQ ID NOS: 165 to 169, 
171 to 176, and 182 to 192 (see Example 6 and Figure 6), and, therefore, are examples 
of isolated translational regulatory elements of the invention that have translational 

25 enhancing activity or IRES activity or both In addition, examples of oligonucleotide 
portions of a eukaryotic mRNA 5' UTR having translational inhibitory activity include 
those encoded by nucleotides 120 to 196 and 167 to 196 of the Gtx homeodomain 
protein mRNA (SEQ ID NO: 1), and by nucleotides 1 to 100 and 100 to 160 of the yeast 
pl50 mRNA (SEQ ID NO: 29). 

30 

Analysis of the mouse Rbm3 5' UTR fiirther revealed modular elements 
consisting of about 9 to 22 nucleotides, which exhibited IRES activity (see SEQ ID 
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NO: 191), translational enhancing activity (see SEQ ID NO: 192), and transladonal 
inhibitory activity (see SEQ ID NOS: 193 and 194). These results indicate that the 
5' UTRs of eukaryotic mRNAs consist of a modular structure containing one or more 
elements that varioxisly effect translational regulatory activity. Accordingly, the present 
5 invention also provide oligonucleotide portions of a 5* UTR of a eukaryotic mRNA that 
have translational enhancing activity, translational inhibitory activity, IRES activity, or a 
combination thereoJ^ including, for example, SEQ ID NOS: 192 to 194. 

As further disclosed, screening of libraries of random oligonucleotides 

1 0 containing 9 or 1 8 nucleotides identified numerous novel oligonucleotides having 
IRES activity, inclxiding combinations of such oligonucleotides. Accordingly, the 
present iirsrention provides relates to a synthetic translational regul^ory element, vMch 
includes at least one oligonucleotide consisting of about 6 to 125 ribonucleotides, or a 
deoxyribonucleotide sequence encoding the oligonucleotide. For example, the invention 

1 5 provide a synthetic translational regulatory element encoded by any of SEQ ID NOS: 42 
to 46, 49, 50, 52, and 89 to 160, wdiich were identified by screening libraries of 
randomized oligonucleotides (Example 4D). Furthermore, as disclosed herein, such 
oligonucleotides having translational regulatory activity can be operatively linked to 
each other, and can be the same or different Examples of synthetic translational 

20 regulatory elements containing operatively linked oUgonucleotides that have 

translational regulatory activity and are difTerent frord each other are exemplified by ' ' 
those encoded by any of SEQ ID NOS: 53 to 88 (see Example 4D), and examples of 
synthetic translational regulatory elements containing operatively linked 
oligonucleotides that have translational regulatory activity and are the same are 

25 exemplified by those encoded by elements contauaing 3 or 5 copies of SEQ ID NO: 51 
or 5 copies of SEQ ID NO: 52, each copy separated by a 9 nucleotide spacer sequence 
(see Example 4C). 

The identification of these modvdar IRES elements such as the 9 nucleotide 
30 Gtx IRES module, which functions independentiy as an IRES, is inconsistent with tiie 
hypothesis that Y-shaped RNA conformations are required for IRES activity in 
celMar mRNA (see, for example, Le and Maizel, supra, 1997). While RNA 
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confoimation may be critical for the activity of some cellular IRESes, as it is for 
picomavirus IRES element, the preset results suggest that KNA secondary structures 
may simply alter the accessibility or presentation of individual IRES modules to the 
translation machinery. 

5 

A nucleotide sequence including the Gtx IRES module (SEQ ID NO: 2) 
directiy bound to 408 ribosomal subunits by base pairing to the 1 8S rRNA (Hu et al., 
supra^ 1999). When tested in the 5' UTR of a monocistronic reporter construct, this 
nucleotide sequence mhibited translation, and mutations that increased the degree of 

10 complementarily to the rRNA decreased the translation of a monocistronic mRNA, 
\^ereas mutations tiiat decreased complementarity increased translation (Tranque et 
al., Pmc. Natl. Acad. Sci.. 95:12238-12243, 1998, which is incorporated herein 
by reference). Nucleotide sequences of the yeast YAPl and yeast pl50 5* UTRs also 
share complementarity with 18S rRNA sequences (see Figure 5). As disclosed herein, 

15 the Gtx IRES module (SEQ ID NO: 2) also functions as an IRES element when 

positioned in the intercistronic region of a dicistronic mRNA. As such, this element 
has opposite effects on translation, depending on whether it is present in a 
monocistronic mRNA or in a dicistronic mRNA. Very active IRES modules likely 
recruit the 40S subunit efGcientiy, but allow it to detach for protein synthesis to occur, 

20 whereas less active IRES modules either may recruit poorly because of weak initial 
interactions-wife^ 40S subimit or may form very stable interactions that effectively 
sequester the mRNA. 

The results disclosed herein mdicate that base pairing between segments of 
25 mRNA and 1 8S rRNA can lead to ribosome recruitment and translation initiation, 
thus explaining an earlier observation that large numbers of eukaryotic mRNAs 
contain segments complementary to 1 8S rRNA (Mauro and Edehnan, Proc. Natl. 
Acad. Sci., USA 94:422-427, 1997,which is incorporated herein by reference). If 
particular subsets of these mRNA sequences function as IRES modules, then large 
30 numbers of different mRNAs may use these sequences to recruit ribosomes. 

Sequences complementary to 18S rRNA also are present in other cellular IRESes 
(Gan et al., supra^ 1998; Bernstein et al., supra, 1997). Sunilarly, the picomaviral 
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IRES elements contains an oUgopyrimidine stretch that is complementary to the 3' end 
of the 18S rRNA (Scheper et al., FEBS Lett 352:271-273, 1994). While this 
oUgopyrimidine sequence is important for IRES activity (Meerovitch et al., J. Vu-ol. 
65:5895-5901, 1991; Pestova et al., J. Virol. 65:6194-6204, 1991), it has not been 
5 shown to base pair to the 1 8S rRNA. Such sequences are not found in the hepatitis C, 
classic swine fever, and bovine diarrhea pestivirus IRESes, although reconstitution 
studies provide indirect evidence for base pairing interactions between these IRESes 
and 18S rRNA (Pestova et al., Genes DeveL 12:67-83, 1998; Pestova and Hellen, 
Virologv 258:249-256, 1999). 

10 • 

In addition to complementarity to 18S rRNA, IRES modules also may recruit 
ribosomes through interactions with ribosomal proteins or with components of the 
translation machinery such as initiation factors, or through other intermediary 
proteins. For example, the iron regulatory element (IRE), which is an RNA 

1 5 translational control sequence, functioned as an IRES module in the presence of an 
engmeered intermediary protein (De Gregorio et al., EMBO 1 18:4865-4874, 1999). 
In this system, the IRE binding protein (IRP-1), which binds tiie IRE, was fused to the 
C-terminal region of initiation factor eIF4G to produce a fusion protein that acts as an 
intermediary protein that can recruit the preinitiation complex, presumably though an 

20 interaction with initiation factor eIF3. This artificial recruitment of the translation 
machinery was sufficient to obtain weak" but detectable IRES activity when the IRE 
was inserted in the intercistronic region of a dicistronic mRNA (De Gregorio et al., 
supra, 1999). 

25 The ability to internally initiate translation may be used by some mRNAs 

imder conditions that are not fevorable for cap-dependent translation, for example, 
during mitosis or poliovirus infection (Johannes et al., Proc. Natl. Acad. Sci.. USA 
96:13118-13123, 1999). However, in contrast to many viral mRNAs, virtually all 
exikaryotic mRNAs are monocistronic and contain a cap structure (Kozak, supra, 

30 1999; Shatkin, £ell 9:645-653, 1976). Thus, the abiUty to internally initiate 

translation may be a reflection of the ability of a sequence to recruit ribosomes, and if 
the recruitment is sufficient to enhance translation, these sequences can confer a 
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selective advantage to some mRNAs over others under competitive situations within 
tiie cell. IRES modules in eukaryotic mRNAs also can be mechanistically analogous 
to the prokaryotic Shine and Dalgamo sequence, albeit more complex because they 
are combmed as multiple elements and the complementary sequences are not fixed in 
5 location either within the mRNA or rRNA. 

As disclosed herein, two, five or ten copies of the 9 nucleotide Gtx IRES 
module (SEQ ID NO: 2) and of two synthetic oligonucleotides (SEQ ID NOS: 51 
and 52) have a synergistic effect on IRES activity (Examples 1 and 4). This effect is 

1 0 similar to that observed with cis-acting transcriptional enhancers (Busby et al., J> MoL 
BioL 241:341-352, 1994), suggesting that the evolution of translational ^ihancers and 
regulators and may parallel tiiat of cis-acting transcriptional enhancers. It has been 
suggested that the synergistic effect of transcriptional enhancers may be due to 
cooperative interactions between weak transcription factors, thus increasing the local 

15 concentration of these factors (Ptashne and Gann, Cuir, BioL 8:R812-R822, 1998; 
Ptashne and Gann, Nature 386:569-577, 1997). Although the mechanisni for the 
observed synergistic effect of translational enhancement is not relevant to the present 
invention, it is noted that, similar to synergistic transcriptional enhancement, weak 
interactions between components of the translational machinery may explain the 

20 synergistic effects disclosed herein. Alternatively, or in addition, increased 

recruitment may4ead40^tfgjier local concentrations of some of the initiation factors 
that otherwise are limiting for translation. For example, high levels of eIF4A, which 
is an RNA helicase, may maintain a more open mRNA secondary structure and allow 
more efficient translation. 

25 

As disclosed herein, a translational regulatory element of the invention is 
useful for modulating the expression of a polypeptide. Synthetic IRESes of the 
invention were up to 63-fold more active than the EMCV IRES and appear to be 
much stronger than most or all of the naturally occurring IRESes that have been 
30 characterized (Borman et al., NucL Acids Res. 25:925-932, 1997; Martinez-Salas, 
Curr. Opin. Biotech. 10:458-465, 1999), In particular, methods of JSnely tuning the 
level of IRES activity by varying the number of operatively linked IRES modules are 
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provided. Thus, the compositions and methods of the invention are useful, for 
example, for gene therapy, protein manu&cturing, and the like. 

The 5* UTR from five dendritically localized mKNAs, ARC, dendrin, MAP2, 
5 CamK Ila, and RC3, internally initiated translation when placed in the intercistronic 
region of a dicistronic RNA (Example 2). Furthermore, these 5' UTRs utilize both 
cap-independent and cap-dependent mechanisms simultaneously, resulting in the 
increased translation of monocistronic reporter mRNAs. Such IRES elements provide 
a means to ensure efiEicient translation of the dendritically localized transcripts. 

0 

A small number of mKNAs are localized to dendrites and the translation of 
some or all of these mRNAs is likely required to affect synaptic efficacy. Inasmuch 
as many of the components of the translational machinery can be limiting at these 
cellular sites, dendritically localized mRNAs were examined for internal ribosomal 

5 entry sites (IRES elements), which can mcrease translation efficiency. The 5' UTRs 
of the dendritically localized mKNAs were tested for IRES activity in tiie 
intercistronic region of dual luciferase dicistronic mKNAs. Transfectionof neural and 
non-neural cell lines with constructs containing the 5' UTRs from the dendritically 
localized mRNAs and the encephalomyocarditis virus (EMCV) IRES resulted in 

0 translation of the second cistron that was significantiy higher than that observed for a 
control construct containing the B-globin 5* UTR. ; Translation facilitated by these 
IRES elements was independent of the first cistron; translation of the second cistron 
occurred even when translation of the first cistron was blocked. Each of the ARC, 
MAP2 and CamK Ila 5\ UTRs functioned as a translational enhancer in a 

5 monocistronic reporter mRNA, whereas the dendrin 5' UTR slightiy inhibited 

translation compared to a control construct Blocking of cap-dependent translation in 
these monocistronic constructs showed that internal initiation accounted for about 
60% to 90% of the translation mitiation of these mKNAs. 

0 The ability of Ihe dendritically localized 5' UTRs to direct mtemal initiation of 

translation was demonstrated using a dicistronic RNA containing two separately 
assayable luciferase reporter genes in three neural cell lines and three non-nemal cell 
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lines (Example 2). When introduced into the intercistronic region of tiie dual 
luciferase dicistronic message, each of tiie 5' UTRs significantly induced translation 
of the second cistron. Furthermore, this activity was not inhibited by the presence of 
a hahpin structure at the 5' end of the dicistronic message, which effectively reduces 

5 the possibility of read-through translation from one cistron to the next, thus 

demonstrating that the 5* XJTRs direct internal initiation of translation in a manner 
characteristic of an IRES. The ARC, dendrin, MAP2 and CamK Ha 5* UTRs 
generated significant levels of IRES activity and internally initiated translation in 
neural and non-neuronal cells and, therefore, can be iised to modulate translation in a 

1 0 variety of different cell types. In addition, while the relative IRES activity generated 
by the 5' UTRs in non-neurai cells was similar to that observed for the neural cell 
Imes, the overall levels of IRES activity differed across cell types, and was 
consistently higher in non-neural cell lines. 

1 5 The translation efficiency of monocistronic mRNAs is believed to be a 

function of the length of the 5* UTR, combmed with other factors such as structure 
and upstream initiation codons. The 6-globin 5' UTR is considered to be the standard 
of cap-dependent translation, being an optimal length, having very little secondary 
structure, and having no upstream AUG codons. In comparison, the lengths of the 

20 5* UTRs for MAP2, CamK Ha and ARC average about 3 tunes the length of the 
fi-globin 5* UTR, yet moneeistFonic mRNAs containing these 5' UTRs all were 
translated significantly better than that containing B-globin. However, the dendrin 
5' UTR inhibited translation of monocistronic RNAs. Inhibition of translation has 
been observed for some viral IRES elements positioned 5* to monocistronic reporter 

25 mRNAs, and can be due to the secondary structure associated with the viral IRES 
inhibiting ribosomal scanning from the cap. As there is no evidence of a change in 
mRNA stability, the dendrin 5' UTR may have more secondary structure than the 
other 5' UTRs examined here. Several 5' UTRs also-have been shown to be important 
in the developmental regulation of translation, and many of these fimction through 

30 IRES elements. 
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Since all monocistronic mRNAs should be equally capable of utilizing cap- 
dependent translation, the differences in translation of the monocistronic mRNAs 
observed herein likely is intrinsic to each 5' UTR. Since mRNA stability was not 
affected by the 5' UTRs (Example 2), the IRES elements contained in these 5' UTRs 
5 likely modulate translation of the monocistronic mRNAs. For the MAP2, ARC, and 
CamK Ha 5' UTRs, IRES activity can supplement cap-dependent translation to 
provide higher levels of translation. Monocistronic reporter gene mRNAs, although 
more physiologically relevant than dicistronic mRNAs, cannot discriminate as to 
whether the 5' UTRs from the dendritically localized mRNAs can employ cap- 
10 independent translational mechanisms. 

The incorporation of a stable hairpin structure to prevent cap-dependent 
translation via ribosomal scanning of the mRNAs confirmed that all four of the 
5' UTRs utilize internal initiation in monocistronic mRNAs (Example 2). In addition, 

1 5 the differential effects of the hairpin on the monocistronic mRNAs demonstrated that 
varying degrees of cap-dependent and independent mechanisms occur for each 
5' UTR. For example, the dendrin 5' UTR which, unlike the other three 5* UTRs, did 
not increase the translation of non-hairpin monocistronic mRNA, exhibited one of the 
strongest effects when cap-dependent translation was inhibited. These results indicatjB 

20 that the dendrin 5' UTR directs translation primarily through cap-independent 

mechanism^=fe«?d5sence of cap-dependent translation expfains the observation that ' 
the dendrin 5' UTR was below control values in the monocistronic construct. The 
ability of a 5* UTR to effect both cap-dependent and cap-independent translation 
indicates that an mRNA containing such a 5' UTR is more likely to be translated than 

25 a cap-dependent mRNA such as 6-globin, particularly under conditions where cap- 
dependent translation is compromised As such, these 5' UTRs can provide a means 
to ensure translation of these dendritically localized mRNAs.. 

In viral RNAs, which lack a cap, IRES elements likely serve to circumvent 
30 cap-dependent recruitment of the translational machinery to the 5' UTR. However, 
since most eukaryotic mRNAs contain a cap structure, IRES elements can serve other 
functions in eukaryotes. In particular, a eukaryotic IRES can ensure the translation of 



wo 01/55369 



PCTAJSOl/02586 



27 

a particular message under conditions viiere cap-dependent translation is inefficient 
or inhibited. For several eukaryotic IRES elements that have been described, stress 
and other cellular-states that generally hinder c^p-dependent protein synthesis have 
little or no effect on IRES mediated translation- A similar case may occur for the 

5 dendritically localized mRNAs, which, as disclosed herein, contain 5* UTRs direct 
cap-dependent and cap-independent translation (Example 2). Furthermore, cap- 
dependent translation is tightly regulated by cellular events and often involves 
modulation of c^-binding activity. As such, an mRNA containing an IRES can be 
translated efficiently during a cellular stress condition such as ischemia or apoptosis, 

10 which otherwise affects overall levels of translation. 

Experiments using a dominant negative form of the eukaryotic initiation 
factor-4 (eIF4) bmding protein, eIF4-bpl, demonstrated that the 5* UTRs from the 
dendritically localized mRNAs initiated translation under conditions wbesre cap 

1 5 bmding is inhibited. Endogenous 4E-BP1 binds to and sequesters eIF4E, disabling 
cap-dependent translation until 4E-BP1 is phosphorylated. The 4E-BP1 mutant, 
which lacks the critical phosphorylation sites necessary for disengagCTient with 
eIF4E, irreversibly binds eIF4E and prevents cap-dependent translation. Transfection 
of this mutant 4E-BP1 with monocistronic 5* UTR constructs reduced tiie cap- 

20 dependent translation from the 5* UTR of 6-globin by greater than 55% (lack of 

further reduction of 6-globin can-be-related to plasmid titer, lack of co-transfection in 
some cells, or feedback on the 4E-BP1 mutant's own capped message), whereas 
constructs containing the 5' UTRs initiated translation of monocistronic mRNAs to 
various degrees in the absence of cap-binding. The lack of dependence of dendrin 5' 

25 UTR on cap-bmding for translation supports the previous observation that this 5' UTR 
is primarily translated through cap-independent mechanisms. These results indicate 
that IRES mediated translation can yield constitutive expression of certain protems, 
regardless of cap-dependent translational control. Furthermore, the degree to which 
the IRES can intemally initiate translation in the absence of cap-binding provides an 

30 additional level of translational regulation, includmg providing a means for 

maintaining translation of cellular messages critical for important cellular events. 



wo 01/55369 



PCT/USOl/02586 



28 

Although it is not clear whether cap-independent mechanisms are utilized 
within the dendritic compartment, each of the jSve 5' UTRs, which were selected 
based on their mKNA subcellular distribution, demonstrated IRES activity to some 
degree. Remarkably, these 5* UTRs do not share characteristics of other IRES 
5 elements, were not selected, for example, based on length, exceptional GC content, or 
excess of upstream AUG start codons, and have no overall sequence similarity. The 
5* UTR from the neural cell adhesion molecule (NCAM) mRNA, a neuronal message 
that is not dendritically localized, did not demonstrate any significant IRES activity. 
It has been reported that cap-analogs, which prevent cap-dependent protein synthesis, 
10 can perturb lasting changes in long-term depression. Those observation together with 
the results disclosed herein suggest that dendrites may utilize internal initiation as a 
mechanism for translation of dendritically localized messages, and may do so in 
response to synaptic stimuli or other perturbations in cellular homeostasis. 

IS As disclosed herein, a translatiojoal regulatory element of the invention can be 

contained in a vector. The vector can be any vector, but preferably is an expression 
vector, which can contain a translation initiation site in addition to the translational 
regulatory element Generally, the translational regulatory element will have 
translational enhancing activity or IRES activity, particularly where the vector is to 

20 further contain an expressible polynucleotide. However, the vector also can contain a 
translational regulalSS^^^lSffrof the invention having translational'ii)hibitory activity, 
for example, where it is desired to introduce the element into a cell to inhibit translation 
of a polypeptide to which it is operatively linked, or where it is desired to use the 
element to bind translational regulatory &ctors tiiat would otherwise bind an endogenous 

25 form of such an element and inhibit translation. 

Where the vector is an expression vector, it can further contain a translation start 
codon, which can be operatively linked to a translation initiation site and to the 
translational regulatory element. However, such elements also can be part of an 
30 expressible polynucleotide that is to be inserted into the vector. Accordingly, a vector of 
the invention can contain an expressible polynucleotide, wherein the translational 
regulatory element is operatively linked to the expressible polynucleotide. As used 
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herein, the teem "expressible polynucleotide" refers to any polynucleotide that can be 
transcribed into an RNA molecule and, particularly, can be translated into a polypeptide. 
The expressible polynucleoti3e"can simply be a coding sequence of a polypeptide, or can 
contain one or more regulatory elements generally associated with a polynucleotide to be 

5 transcribed or translated, for example, a transcriptional promoter, transcription start site, 
polyadenylation signal sequence, Kozak sequence, initiator methionine codon, stop 
codon and the like. Where the e^ressible polynucleotide contams all or substantially all 
of the elements required for expression of a polypeptide, it also is referred to herein as a 
"cistron". In referring to "substantially" all of the element required for expression, it will 

10 be recognized that a cistron may or may not contain a ttanslational regulatory element, 
since such an element is provided according to the present invention. 

It should further be recognized that an expressible polynucleotide can contain 
more than one cistron, for exanq)le, can be dicistronic such as the numerous dicistconic 

1 5 expressible polynucleotide described in the Examples. In general, w*iere an e3q)ressible 
polynucleotide contains more than one cistron, the cistrons are operatively linked such 
that they are transcribed as a single RNA molecule. In addition, a spacer nucleotide 
sequence generally will be present between the cistrons, and, if desired, atranslational 
regulatory element of the invention having IRES activity or translational enhancing 

20 activity or both can be inserted into the spacer nucleotide sequence. As disclosed herein, 
an expressible polynucleotide can be contained^a-a^ector of the invention. 

A vector of the invention also can contain additional elements generally found in 
a vector, including, for example, a nucleotide sequence ^coding a polypeptide that 

25 confers antibiotic resistance upon a cell containing the vector, including a bacterial cell, 
an insect cell, or a mammalian cell, as desired, and can contain an origin of replication, 
for example, a bacterial origin of replication such that the vector can be amplified m a 
bacterial host cell. Accordingly, the invention also provides a host cell containing a 
vector of the inventioiL Vectors, which include plasmid vectors and viral vectors such 

30 as bacteriophage, baculovirus, retrovirus, lentivirus, adenovirus, vaccinia virus, 

semliki forest virus and adeno-associated virus vectors, and host cells for containing a 
particular vector, are well known and can be purchased from a commercial source 
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(Piomega, Madison WI; Stratagene, La Jolla CA; Invitrogen, La Jolla CA) or can be 
constructed by one skilled in the art (see, for example, Meth. Enzymol. . Vol. 185, 
Goeddel,"edr(AcademicPre^^^ 1990); JoUv. Cane. Gene Ther 1:51-64: 1994; 
Flotte, J. Bioenerg. Biomemb. 25:37-42, 1993; Kirshenbaxim et al., J. Clin. Invest. 
5 92:381-387, 1993; each of which is incorporated herein by reference). 

Where a translational regulatory element of the invention is to be introduced 
into a cell in culture or in an organism, a viral vector can be particularly useful. Viral 
vectors provide ibe advantage that they can infect host cells Avith relatively high 

1 0 efGciency and can infect specific cell types. For example, a translational regulatory 
element of the invention operatively linked to an expressible polynucleotide that is to 
be expressed in lung cells can be cloned into an adenovirus vector, which effectively 
infects Ixmg epithelial cells. Similarly, a translational regulatory element of the 
invention operatively linked to an expressible polynucleotide that is to be e^q^ressed in 

15 neuronal cells can be cloned into a herpesvirus vector, which effectively infects 

neuronal cells. Viral vectors have been developed for use in particular host systems, 
particularly mammalian systems and include, for example, retroviral vectors, other 
lentivirus vectors such as those based on the human immunodeficiency virus (HIV), 
adeno-associated virus vectors, herpesvirus vectors, vaccinia virus vectors, and the 

20 like (see Miller and Rosman, BioTechniques 7:980-990, 1992; Anderson et al., Nature -^ 
392:25-30 SuppL, 1998; V^ ffiiinff ld So mia, Matige 389:239-242, 1997; Wilson. New 
Engl. L Med. 334:1 185-1 187 (1996), each of which is incorporated hereui by 
reference). 

25 A translational regiilatory element of the invention, which can be contained in 

a vector, can be introduced into a cell by any of a variety of methods known in the art 
(Sambrook et al. Molecular Cloning: A laboratory manual (Cold Spring Harbor 
Laboratory Press 1989); Ausubel et al.. Current Protocols in Molecular Biology. John 
Wiley and Sons, Baltimore, MD (1987, and supplements through 1995), each of 

30 which is incorporated herein by reference). Such methods include, for example, 
transfection, lipofection, microinjection, electroporation and, with viral vectors, 
infection; and can include the use of liposomes, microemulsions or the like, which can 
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facilitate introduction of the polynucleotide into the cell and can protect the 
polynucleotide ftom degradation prior to its introduction into the cell. The selection 
of a particular method will depend, for example, on the cell into which the 
transladonal regulatory element is to be introduced, as well as whether the cell is 

5 isolated in culture, or is in a tissue or organ in culture or in situ, and, as disclosed 
herein, ttie selection system can be based on a reporter polypeptide that is e?q)ressed, 
at least in part, due to the translational regulatory element of the invention. 
Introduction of a translational regulatory element of the invention into a cell by 
infection with a viral vector is particularly advantageoiis in that it can efficiently 

10 introduce the nucleic acid molecule into a cell ex vivo or in vivo (see, for example, 
U,S. Pat No. 5,399,346, which is incorporated herein by reference). 

The present invention further relates to a recombinant nucleic add molecule, 
which contains a translational regulatory element of tiie mvention operadvely linked to 

15 an expressible polynucleotide. TTie Kjpressible polynucleotide can comprise a dstron, 
and the translational regulatory element have translational oihancing activity or IRES 
activity or both. The expressible polynucleotide also can comprises, in operative linkage 
in a 5* (upstream) to 3' (downstream) orientation, a first cistron, a spacer nucleotide 
sequence, and a second cistron, and the translational regulatory element can have IRES 

20 activity, and be operatively linked to the second cistron. The synthetic translational 

regulatory element in a recombinant nucleic acid molecrfe of the invention can include 
two or more operatively linked oligonucleotides having translational regulatory activity, 
for example, ten copies of SEQ ID NO: 2, each of which can be separated by a sspacer 
nucleotide sequence. 

25 

The polypeptide encoded by a recombinant nucleic acid molecule of the 
invention can be any polypeptide, including, for example, an enzyme such as 
P-galactosidase, ^-glucuronidase, luciferase, alkaline phosphatase, glutathione 
S-transferase, chloramphenicol acetyltransferase, guanine xanthine 
30 • phosphoribosyltransferase, and neomycin phosphotransferase; a viral or bacterial 
polypeptide or an epitope expressed by a virus, bacterixim or other pathogenic 
organism; or a growtii factor, a hormone and a receptor for a growth factor or a 



wo 01/55369 



32 



PCT/USOl/02586 



honnone. The recombinant nucleic acid molecule also can encodes two or more 
polypeptides, for example, a first polypeptide that encodes a reporter polypeptide, 
which can be a selectable marker, and a second polypeptide encoding a polypeptide of 
interest, for example a therapeutic polypeptide. 

5 

The term "oligonucleotide", "polynucleotide" or "nucleotide sequence" is used 
broadly herein to mean a sequence of two or more deoxyribonucleotides or 
ribonucleotides that are linked together by a phosphodiester bond. As such, the terms 
include RNA and DNA, which can be a gene or a portion thereof, a cDNA, a 

1 0 synthetic polydeoxyribonucleic acid sequence or polyribonucleic acid sequence, or 
the like, and can be single stranded or double stranded, as well as a DNA/RNA 
hybrid. Furthermore, the terms "oligonucleotide", "polynucleotide" and "nucleotide 
sequence" include naturally occurring nucleic acid molecules, which can be isolated 
fiom a ceU, as well as synthetic molecules, which can be prepared, for example, by 

1 5 methods of chemical sjmthesis or by enzymatic methods such as by the polymerase 
chain reaction (PGR). 

For convenience of discussion, the term "oligonucleotide" generally is used to 
refer to a nucleotide sequence that is has translational regulatory activity, particularly 

20 a nucleotide sequence of a translational regulatory element of the invention. In 

comparison, the term "polynucleotid6"^f^*frueteotide sequence" generally refers to a 
sequence that encodes a peptide or polypeptide, acts as or encodes a desired 
regulatory element, provides a spacer sequence or cloning site, or the like. It should 
be recognized, however, that such a use only is for convenience and is not intended to 

25 suggest any particular length or other physical, chemical, or biological characteristic 
of the nucleic acid molecule. 

Synthetic methods for preparing a nucleotide sequence include, for example, 
the phosphotriester and phosphodiester methods (see Narang et al., Meth. EnzymoL 
30 68:90, (1979); U.S. Pat. No. 4,356,270, U.S. Pat. No. 4,458,066, U.S. Pat. 

No. 4,416,988, U.S. Pat. No. 4,293,652; and Brown et al, Mefli. EnzymoL 68:109, 
(1979), each of which is incoiporated herein by reference). In various embodiments, 
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an oligonucleotide of the invention or a polynucleotide useful in a method of the 
invention can contain nucleoside or nucleotide analogs, or a backbone bond othor tiian 
a phosphodiester bond. The nucleotides comprising an oligonucleotide 
(polynucleotide) generally are naturally occurring deoxyribonucleotides, such as 

5 adenine, cytosine, guanine or thymiae linked to 2*-deoxyribose, or ribonucleotides 
such as adenine, cytosine, guanine or uracil linked to ribose. However, a 
polynucleotide also can contain nucleotide analogs, including non-naturally occurring 
synthetic nucleotides or modified naturally occurring nucleotides. Such nucleotide 
analogs are well known in the art and commercially available, as are polynucleotides 

10 containing such nucleotide analogs O^^in et al., Nucl. Acids Res, 22:5220-5234 (1994); 
Jellinek et al., Biochemistrv 34:11363-11372 (1995); Pagratis et al., NafiHS 
BiotBchnoL 15:68-73 (1997), each of which is mcorporated herein by reference). 

The covalent bond linking the nucleotides of an oligonucleotide or 
1 5 polynucleotide generally is a phosphodiester bond. However, the covalent bond also 
can be any of numerous other bonds, including a thiodiester bond, a phosphorothioate 
bond, a peptide-like bond or any other bond known to those in the art as usefiil for 
linking nucleotides to produce synthetic polynucleotides (see, for example. Tarn et al., 
Nucl. Acids Res. 22:977-986 (1994); Ecker and Crooke, BioTechnologv 13:351360 
20 (1 995), each of wiiich is incorporated herein by reference). The incorporation of 
non-naturally occurring nucleotide analogs or bonds linkingihe^nucleotides or 
analogs can be particularly useful \^^ere the nucleotide sequence is to be exposed to 
an environment that can contain a nucleolytic activity, including, for example, a tissue 
culture medium or upon administration to a living subject, since the modified 
25 nucleotide sequences can be less susceptible to degradation. 

A polynucleotide comprising naturally occurring nucleotides and 
phosphodiester bonds can be chemically synthesized or can be produced usmg 
recombinant DNA methods, using an appropriate polynucleotide as a template. In 
30 comparison, a polynucleotide comprising nucleotide analogs or covalent bonds otiier 
than phosphodiester bonds generally are chemically synthesized, although an enzyme 
such as T7 pol3mierase can incorporate certain types of nucleotide analogs iato a 
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polynucleotide and, therefore, can be used to produce such a polynucleotide 
recombinantly from an s^propriate template (Jellinek et al*, supra^ 1995). 



A recombinant nucleic acid molecule, or a vector, of the invention can contain 
5 additional sequences that confer a desired characteristic on the polynucleotide. In 
particular, the recombinant nucleic acid molecule or vector can contain an operatively 
linked transcriptional regulatory element, including a transcriptional promoter or 
enhancer, which can be a constitutively expressed regulatory element, which 
maintains e:»q)ression of a polynucleotide at a relatively constant level of activity 
0 (basal level), or can be an inducible regulatory elements. Constitutively expressed 
regulatory elements can be expressed in any cell type, or can be tissue specific, vMch 
are expressed only in particular cell types, phase specific, which are expressed only 
during particular developmental or growth stages of a plant cell, or the like. A 
regulatory element such as a tissue specific or phase specific regulatory element or an 
5 inducible regulatory element useful in constructing a recombinant polynucleotide or 
in a practicing a method of the invention can be a regulatory element that generally, in 
nature, is found in a plant genome. However, the regulatory element also can be from 
an organism other than a plant, including, for example, from a plant virus, an animal 
virus, or a cell from an animal or other multicellular organism. 

0 

As disclosed herein,"translational repiatSfy^Igments can be selected based on 
their ability to confer tissue specific translation, or translation in response to an 
inducing £^ent As used herein, the term "tissue specific," when used in reference to a 
translational regulatory element, means a nucleotide sequence that effects translation 

5 of an operatively linked expressible polynucleotide in only one or a few cell types. 
As used herein, the term "inducible," when used in response to a translational 
regulatory element, means a nucleotide sequence that, when present in a cell exposed 
to an inducing agent, effects an mcreased level of translation of an operatively linked 
expressible polynucleotide as compared to the level of translation, if any, in the 

0 absence of an inducing agent. 
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The tenn "inducing agent" is used to refer to a chemical, biological or physical 
2^ent that effects translation from an inducible translational regulatory elemrat In 
response to exposure to an inducing agent, translation from the element generally is 
initiated de novo or is increased above a basal or constitutive level of e5q)ression. 

5 Such induction can be identified using the methods disclosed herein, including 
detecting an increased level of a reporter polypeptide encoded by the expressible 
polynucleotide that is operatively linked to the translational regulatory element. An 
inducing agent can be, for example, a stress condition to which a cell is exposed, for 
example, a heat or cold shock, a toxic agent such as a heavy metal ion, or a lack of a 

1 0 nutrient, hormone, growth factor, or the like; or can be exposure to a molecule that 
affects the growth or differentiation state of a cell such as a hormone or a growth 
factor. By examining the translational regulatory activity of a translational regulatory 
elemmt of the invention or a combination of such elements in cells e5q)osed to such 
conditions, or cells of a particular cell type, translational regulatory elements that are 

1 5 inducible or tissue specific can be identified. As such, a recombinant nucleic acid 
molecule of the invention can contain a tissue specific or inducible translational 
regulatory element. 

An expressible polynucleotide can be any polynucleotide that is operatively 
20 linked to a translational regulatory elem^t of the invention and encodes a polypeptide. 
As used herein, the term "operatively linked" means that a regulatory-element, which 
can be a transcriptional or translational regulatory element, including a translational 
regulatory element of the invention, is positioned with respect to a transcribable or 
translatable nucleotide sequence such that the regulatory element can effect its 
25 regulatory activity. A transcriptional enhancer, for example, can be located at any 
distance, including adjacent to or up to thousands of nucleotides away from, and 
upstream or downstream from the promoter and nucleotide sequence to be 
transcribed, and still exert a detectable effect on the level of expression of an encoded 
reporter molecule. In comparison, a translational regulatory element generally is 
30 contained within about 1 to 500 nucleotides, particularly witiiin about 1 to 100 
nucleotides of a translation start site. In addition, tiie term "operatively Imked" is 
used with respect to a first and second polypeptide (or peptide) to refer to encoding 
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sequences that are linked in frame such that a fusion polypeptide can be produced. 
Similarly, the term is used to refer to two or more cistions of an expressible 
polynucleotide fliat are transcribed as a single RNA molecule, which can contain, for 
example, an IRES element of the invention in an intercistronic position. 

5 

As used herein, the temi "heterologous" is used in a comparative sense with 
respect to a nucleotide sequence to indicate eilher that the nucleotide sequence is not an 
endogenous nucleotide sequence in a cell into which it is to be introduced, or that the 
nucleotide sequence is part of a construct such that it is in a form other than it normally 

10 would be found in a cell. For exanq)le, a heterologous nucleotide sequence can be a 
polynucleotide that encodes a mammalian viral polypeptide, where &e heterologous 
nucleotide sequence is to be introduced into a plant cell; i.e., the nucleotide sequence is 
heterologous witii respect to the plant cell. A heterologous nucleotide sequence also can 
be, for example, a portion of a plant gene sequence that is operably linked to a plant 

1 5 CRRE, provided that the construct is not normally found in a plant cell, for example, a 
plant CRRE operably linked to a heterologous nucleotide sequence consisting of a 
. hypothetical plant gene exon 3, wherein the plant gene can be a circadian-regulated gene 
comprising the plant CRRE to which the exon is operably linked. Thus, a heterologous 
nucleotide sequence in a recombinant polynucleotide of the invention can be any 

20 nucleotide sequence that is not normally part of the plant circadian-regulated gene 

from which the oligonucleotide comprising the CRRE^tJmp&tient of the recombinant ' 
polynucleotide is obtained; or, if it is a part of the plant circadian-regulated gene from 
which the CRRE is obtained, it is an orientation other than it would normally be in the 
gene, for example, is an antisense sequence, or comprises at least a partially 

25 discontinuous sequence as compared to the genomic structure. 

The present invention also relates to a kit, which contains a translational 
regulatory element of the inventioa In particular, a kit of the invention can contain a 
plurality of synthetic transcriptional regulatory elements, particularly a plurality of 
30 different translational regulatory element. The translational regulatory elements of the 
plurality can be used individually for a specific purpose depending on the characteristics 
of the element, for example, translational enhandng activity, IRES activity, or 
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translational inhibitory activity, or tissue specific e>q)ression, inducible expression, or 
other characteristic. In addition, &e translational regulatory clients of the pluraKly^ 
be used in combination to provide, for exan^le, a high level'of translational mhancing 
or IRES activity. 

5 The translational regulatory elements in a kit of tiie invention can contain one or 

more flanking sequence, which, mdependently, can be at a 5* end or a 3' end or both 
5' and 3* ends. Such flanking sequences can be, for example, restriction endonuclease 
recognition sites or recombinase recognition sites such as a lox or att sequence, and can 
be single stranded overhangs. As such, the flanking sequence can be used to facilitate 

1 0 operatively linking two or more Iramlational regulatory elements in the plurality to each 
other or for luddng the element to any other polynucleotide. 

The present invention also relates to a mefliod of producing a genetically 
modified ceU that exMbits altered esqxression of a polypeptide. The method can be 

15 performed, for example, by introducing a translational regulatory element of the 
invention into a cell, whereby the synthetic translational regulatory element 
operatively linked to a nucleotide sequence encoding a polypeptide. As disclosed 
herein, the translational regulatory element can be operatively linked to tiie nucleotide 
sequence, which can be an expressible polynucleotide, prior to introducmg the 

20 translational regulatory element into ihe cell. For example, the translational 

regulatory element can have translational enhancing activity or IRES activity such 
that the encoded polypeptide is expressed in the cell, preferably in a desired level. 
The polypeptide can be any polypeptide, including, for example, a reporter 
polypeptide, a toxin, or a therapeutic agent. 

25 

As disclosed herein, the nucleotide sequence to which the translational 
regulatory element is operatively Imked also can be a sequence of an endogenous 
gene m the cell genome. Where it is desired, for example, to decrease the e^qnression 
of an endogenous polypeptide in a cell, for example, e3q)ression of an activated Ras 
30 protein, the translatioDyal regulatory element is selected based on its having 

translational inhibitory activity. The translational regulatory element can be targeted 
to a particular locus, for example, upstream of the start codon of a mutant Ras gene 
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using a method such as homologous tecombination. In addition, a translational 
regulatory element having translational inhibitory activity also can be introduced into 
a eukaryotic cell such that it can randomly integrate into the cell genome, thus 
providing a novel means to obtain a "knock out" phenotype in a cell. 

5 

The production of a genetically modified cell expressing a heterologous 
polypeptide, for example, is facilitated by including the nucleotide sequence of 
interest as one cistron of a polycistronic expressible polynucleotide, to which one or 
more translational regulatory elements of the mvention, including one or more IRES 

1 0 elements is operatively linked. One or more other cistrons in the expressible 
polynucleotide can encode, for example, a reporter molecule such that cells 
containing the construct can be identified, selected and isolated. One or more other 
cistrons in the construct also can encode a second polypeptide that effects, for 
example, tiie level of transcription of the expressible polynucleotide in the genetically 

1 5 modified cell, or the activity of the expressed heterologous polypeptide in the cell. 

Accordingly, the present invention also relates to a genetically modified 
eukaryotic cell produced by a method of the invention, as well as to a transgenic 
non-human eukaryotic organism containing or derived firom such a genetically 

20 modified cell, a cell or tissue obtamed from the transgenic non-human eukaryotic 
organism, and a cDNA or genomic DNA library prepared from: thenransgeni c " 
non-human eukaryotic organism or firom a cell or tissue obtained fix>m said transgenic 
non-human eukaryotic organism. Such a genetically modified cell or transgenic 
organism can be usefiil, for example, to produce a polypeptide of interest, or as a 

25 model system for a pathologic condition, for example, a condition characterized in 
part by overexpression or accumulation of a particular polypeptide. 

The present invention fiirther provides a method of altering translational activity 
in a eukaryotic cell, the method comprising introducing into the cell a synthetic 
30 translational regulatory elraient. As disclosed herein, introduction of a translational 
regulatory element of the invention having translational enhancing activity can reduce 
the level of translation in a cell. While no mechanism for this action is proposed herein 
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or, in feet, relevant to practicing a method of flie invention, one possibility is that the 
element can bind to and sequester tians-acting translational regulatoiy &ctois such as 
eukaiyotic initiation factors or the like, similar to effects seen with transcriptional 
regulatory elements when introduced into cells. Thus, by introducing a translational 

5 regulatory element of the invention having translational enhancing activity or IRES 
activity into a eukaiyotic cell, tiie translational activity in the eukaryotic cell can be 
reduced or inhibited Conversely, by mtroducing a translational regulatory element 
having translational hihibitory activity into a eukaryotic cell, translational activity in the 
cell is iacreased due, for example, to the sequestering of a trans-acting factor that 

10 otherwise binds to an endogenous translational inhibitory sequence in the cell to inhibit 
translation. 

The present invention also provides a means to convenientiy and efiBcientiy 
select specific cells and cell populations. Identification of such cells is effected by 

1 5 introducmg a construct containing a translational regulatory element, for example, an 
IRES element operatively linked to a nucleotide sequence encoding a reporter 
molecule such as a fluorescent protein or selectable marker. Where an IRES element 
is used, the IRES element can drive expression of a monocistronic reporter cassette or 
can drive expression of a second or otiier cistrons in a polycistroiuc cassette, one or 

20 more of which encode a reporter polypeptide and one or more others of which can 
encode any polypeptide or polypeptides of interest. Generally, in a polycistronic 
reporter, an IRES element is contained in an intercistronic position, particularly 
upstream of the cistron to be regulated by the IRES. Thus, desired cells can be 
selected based on expression of the reporter polypeptide driven by the translational 

25 regulatory element, and, if desired, the cell can be one that also expresses a 
heterologous polypeptide of interest, which can be co-expressed from the same 
construct as the reporter polypeptide. 

Particularly useful co-expressed reporter polypeptides can include cell surface 
30 polypeptides, which act as markers that permit immunocytometric or affinity 

chromatographic selection and isolation of tiie cells expressing the protein of interest; 
enzymes or other polypeptides that confer antibiotic resistance, which allow the use of 
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a growth selection method to identify, select and isolate cells co-e3q)Tessing the 
protein of interest; other cDzymss fbat confer a unique reporting capability to a cell, 
for example, alkaline phosphatase, horseradish peroxidase drany other enzyme 
generally used for as a histologic or diagnostic reporter molecule. Such polypeptides, 
5 including a reporter polypeptide and co-expressed protein of interest can be expressed 
xmder in vivo or in vitro conditions and, similarly, the cells e3q)ressing the polypeptide 
can be identified, selected, or isolated under in vivo or in vitro conditions. 



The present invention also provides methods for improving protein yields by a 

1 0 cell, which can be a cell in culture or a cell in an organism, through enhanced protein 
stability, enhanced cell stability and viability, and the like. Stability of expressed 
protein products can be improved using, for example, IRES mediated co-expression 
of proteins or peptides such as chaperone pioteins, heat shock proteins, proteins such 
as superoxide dismutase and catalase, which decrease the levels of oxidizing agents in 

IS a cell, protease inhibitors, phosphatase uohibitors, and the like, which can enhance the 
stability of the protein of interest or a cell expressing the protein of interest, thereby 
enhancing the yield of the protein obtainable fix)m the cell. For exampler"^ 
co-expression of protease inhibitors with a protein of interest can protect the protein 
fix>m cellular proteases, thus prolonging the time the protein is present in a cell and 

20 increasing the yield obtainable from the cell. Similarly, enzyme inhibitors or 

modulators such^as caspase inhibitors can maintain cellular integrity=4urmg-^^ ^ 
ejqpression of the protein of interest by prolonging the life of the cell. Antibiotics 
peptides such as magainms, defensins, cryptdins and the like also can be co-expressed 
with the protein of interest, thereby protecting the cell from microbial contamination, 

25 thus increasing the likelihood that the cells can be maintained in a healthy and 

productive state. Furtheraiore, by using concatemers of the disclosed translational 
regulatory elements, including concatemers of the same or different elements, very 
high levels of translation can be obtained and translation in particular cell types or at 
particular stages of the cell cycle can be obtained. Such methods are particularly 

30 valuable to the bioprocessing industry, which is involved in large scale manufacture 
of proteins for medical and industrial use. 
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Improved protein yields also can be obtained by using an IRES element for 
co-expressing a transcription Victor specific for a promoter element contained in tiie 
construct comprising the IRES element, or for co-e5cpressing translational fegulatbry 
factors such as translation eukaryolic initiation fectors that are specific for the IRES 

5 or other translational regulatory elements contamed in the construct. Such methods 
provide a means for enhancing cellular signaling in the environment of the cell 
producing the target protein. IRES mediated co-expression of agents that affect the 
cellular envu-onment of the cell expressing a target protem can be particularly useful 
for in vivo diagnostic and therapeutic applications, as well as for in vitro 

1 0 bioproduction methods and as research tools. For example, the co-expressed agents 
can include proteins or peptides fit)m the classes of neuromodulators, peptide/protein 
hormones, growth factors, adhesion proteins, cell media peptides, and the like. 

The present invration also provides methods for producmg complex proteins 
15 or protein systems. For example, IRES-mediated co-expression of the subunits of a 
multi-component protein has been reported (see, for example, U.S. Pat. 
No. 6,060,273). However, unique applications using the presently disclosed 
translational regulatory elements, includmg IRES elements, allow the co-expression 
of enzymes or pro-enzymes with specific substrates, prosthetic groups, allosteric 
20 modulators, processmg en2ymes, chaperones, and the like. 

The following examples are intended to illustrate but not limit the invention. 

EXAMPLE! 

25 TDRNTTFTC ATTON OF A MODULAR IRES ELEMENT THAT 

FNHANnr^; translational activity 

This example describes the identification and characterization of a nine 
nucleotide IRES element from the 5* untranslated region (5* UTR) of the mRNA 
encoding the Otx homeodomain protein. 
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A. Identification of a Nine Nucleotide IRES Element in the Gte 5' UTR 

Dicistronic constructs were based on the pGL3-R2 (RP) and pGL3-R2h (RPh) 
dicistronic reporter vectoKJ (Chappell et al., Proc. Natl. AcadTSci.. USA 97:1536- 
1541, 2000, each of which is incorporated herein by reference; see, also, Stoneley et 
5 al., supra, 1998). These vectors encode a dicistronic mRNA, with the Renilla (sea 
pansy) luciferase gene upstream of the Photinus (firefly) luciferase gene. An SV40 
promoter and enhancer drive expression. The RPh vector contains a 60 bp inverted 
repeat located 5' (upstream) of the first cistron. The 5' UTR of the Gtx mRNA (SEQ 
ID NO: 1; Komuro et al., supra^ 1993) was synthesized as two overlapping 

1 0 oligonucleotides, amplified by PGR with Pfii DNA polymeraise (Stratagene, La Jolla 
CA), and inserted into the intercistronic region as an Eco RI to Nco I fi:agment 
immediately upstream of the Photinus luciferase initiation codon in both the RP and 
RPh vectors. Other inserts were similarly cloned into the intercistronic region using 
the Eco RI and Nco I restriction sites, or using the Spe I and Eco RI restriction sites. 

1 5 The sequences of ail constructs were verified. Plasmids containing the 

chloramphenicol acetyl transferase (CAT) gene mstead of the Photinus luciferase 
gene were derived by deleting the luciferase gene and replacing it with the CAT gene 
using the Nco I and Bam HI restriction sites (RCh). The CAT gene was firom the 
pCAT3 control vector (Promega; Madison WI). 

20 

Cell lines were obtained firom ATCC and maintained according to their"^^^ " - 
instructions. Dicistronic reporter constructs were transfected into cells using 
FuGENE 6 reagent (Roche; Indianapolis IN), according to the manufacturer's 
iostructions. Transfection efficiencies were normalized by cotransfection with the 

25 pCMVp vector (Clontech; Palo Alto CA). Cells were harvested after 24 hr and 
luciferase activity was determined usuig the dual reporter assay system (Promega). 
p-galactosidase activity was assayed using the Fluoreporter lacZ kit (Molecular 
Probes; Eugene OR) and fluorescence was measured using the Millipore Cytofluor 
2450 system. CAT enzyme activity was measured using n-butyryl CoA according to 

30 technical bulletin #84 (Promega). 
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Northern blot analysis was peifoimed as described by Mauro and Edelman 
{svpra^ 1 997) using RNA prepared by the guaniduiium thiocyanate method from the 
neuronal cell line Neuro 2a QSI2a; Chomczyjnsld and Sacchi, Anal. BiiQghem. 
162:156-159, 1987, which is incorporated by reference). Poly(A)"*" RNA was selected 

5 using Oligotex oligo(dT) beads (Qiagen; Chatsworth CA). Hybridizations were 
performed usmg Rapid hyb buffer (Amersham; Piscataway NJ) witii a probe 
containing the entire Phottnus luciferase coding sequence. This probe was obtained 
by digesting the dicistronic construct at the Nco I and Sal I restriction sites. 
Hybridi2ations were performed at 65X; wash stringency was 65°C in O.IX saline 

10 sodium citrate (SSC). 

The 5' UTR of the Gtx mRNA (SEQ ID NO: 1) is 196 nucleotides in length 
(Komuro et al., supra^ 1993; see, also, GenBank Accession No. L08074, ATG is at 
nucleotides 197 to 199) and contains several complementary sequence matches to 

15 18S rRNA, one of which was the focus of a previous study (Hu et al., stq>ra^ 1999). 
In order to determine \*ether a sequence of 5' UTR can interact directly with 
complementary sequences in the 18S rRNA, the Gtx 5' UTR was inserted into the 
intercistronic region of the dicistronic mRNA (Gtxi.i96/RP), which contains the 
Renilla luciferase codmg sequence as the first cistron and the Photinus luciferase 

20 coding sequence as the second cistron (Stoneley et al., supra, 1998). The presence of 
the 196-nucleotide GtJC 5' UTR enhanced expression of the downstream cistron about 
7.2-fold over background in the rat neuronal N2a cell line and about 8.8-fold in the rat 
glioma C6 cell line. Hie IRES activity was approximately 47% of that of the 
encephalomyocarditis virus (EMCV) IRES inN2a cells and 84% of that of EMCV in 

25 C6 cells when tested in the same vector system (EMCV/RP). Translation of the 

p-globin mRNA is cap-dependent (Keiper and Rhoads, NucL Acids Res. 25:395-402, 
1997). As such, insertion of the 5' UTR of the p-globin mRNA in the intercistronic 
region served as a control (no IRES activity; p-globin/RP). 

30 In order to confirm that enhanced expression of the second cistron was not 

dependent on translation of the first cistron, translation of the first cistron was 
inhibited by introducing an inverted repeat, which has potential to form a stable 
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hairpin structure, in the 5' UTR (GtXi-i96/RPh). When normalized to the activity of 
the RP vector, the presence of the hairpm structure resulted in increased IRES activity 
up to 3i-fold over background. Northern blot analysis using poly(A)+ RNfA isolated 
from N2a cells transfected with the Gtxi.ige/RP vector and probed with a Photinus 
5 luciferase fragment showed that the cells expressed only a single detectable mRNA of 
the correct expected size. These results demonstrate that enhanced expression of the 
downstream Photinus luciferase gene was not due to the generation of a functional 
monocistronic mRNAs due, for example, to an unusual splicing event or RNA 
fragmentation* 

0 ' ' - . 

In order to determine whether the Gtx 5' UTR contains shorter segments with 
IRES activity, the 5' UTR was sequentially deleted from the 5* and 3' ends, and was 
fragmented into S non-overlappmg segments of approximately 40 nucleotides (see 
Figure 1), each of which was tested for IRES activity. To minimize the contribution 

5 of the first cistron, the RPh vector was used for these constructs. All of the deleted 
and fragmented sequences examined had IRES activity, with the exception of 
constructs Gtxi2o-i96/RPh and Gtxi67-i96/RPh. Deletion of nucleotides 167 to 196 of 
SEQ ID NO: 1 increased IRES activity to a level greater than that of the entire 5' UTR 
(Gtxi_i66/RPh and Gtxi2o-i66/RPh). These results indicate that either the 167 to 196 

0 region of SEQ ID NO: 1 inhibits IRES activity, or the IRES activity exhibited by 
nucleotides 1 to 1 66 of SEQ ID NO: 1 is dependent on its location relative to the 
initiation codon. 

The nucleotide sequence 133 to 141 (CCGGCGGGT; SEQ ID NO: 2), which 
5 is contained within the Gtxno-iee/RPh construct, is 1 00% complementary to 
nucleotides 1332 to 1 124 of 18S rRNA, and has been shown to crosslink to its 
complement within intact 40S subunits (Hu et al., supra, 1999), The 9 nucleotide Gtx 
sequence (SEQ ID NO: 2) was direcfly tested for IRES activity. The spacing of the 
9 nucleotide Gtx sequence relative to the initiation codon was maintained as in the 
0 Gtxi2o-i66/RPh construct by using a polynucleotide sequence identical to the whole 
p-globin 5' UTR (SEQ ID NO: 3), which lacks detectable IRES activity. IRES 
activity associated with this construct (see SEQ ID NO: 4) was about 3.4-fold over the 
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background These results demonstrate that a core 9 nucleotide sequence of tiiie Gtx 
5* UTR(SEQ ID NO: 2) has IRES activity. As such, this sequence is referred to as 
the Gtx IRES module (SEQ ID NO: 2). 

5 Operativelv Linked IRES Modules Enhance Translation SvnergisticaUy 

Dicistronic constructs were generated containing 1 (SEQ ID NO: 5), 5 (SEQ 
ID NO: 6), or 10 (SEQ ID NO: 7) copies of the Gtx IRES module (SEQ ID NO: 2). 
The IRES modules were positioned beginning 25 nucleotides upstream of the 
initiation codon using p-globin 5* UTR sequences (see SEQ ID NOS: 5 to 7). 

10 Constructs with 5 or 10 copies of the IRES module were arbitrarily spaced 

9 nucleotides from each oflier using a repeated segment of the p-globm 5' UTR 
(designated SI; see, for example, SEQ ID NO: 6) • InN2a cells, IRES activity of the 
Gtxi33-i44/RPh construct, which contamed a single copy of the IRES module (SEQ ID 
NO: 5), was 2.5-fold over background; IRES activity of the (Gtxi33.i44)5(SI)$p/RPh 

15 construct (5 copies; SEQ ID NO: 6) was 164-fold over background; and IRES activity 
of the (Gtxi33-i44)io(SI)59P/RPh construct (10 copies; SEQ ID NO: 7) was 570-fold over 
background (IRES activities were normalized to activity of the parent RPh vector). In 
comparison, when normalized to the activity of the RP vector, IRES activity of the 
(Gtxi33-i44)io(SI)9p/RPh) construct was about 6,000.fold over background. 

20 

The i3!d&<y£Photinus luciferase activity to Renilla luciferase activity (P:R 
ratio) reflects the absolute increase in Photinus luciferase activity for all constructs 
except tiiose that contain 5 or 10 copies of the Gtxi33.i44 sequence with the SI spacer, 
in which the expression of both cistrons increased. The level of Renilla luciferase 

25 expression obtained with the (Gtxi33-i44)io(SI)9P/RPh construct was approximately 
20-fold higher than that of construct RPh. Norfhem blot analysis of mRNAs from 
N2a cells indicated that the mRNA levels were not significantly diBFerent Thus, it is 
unlikely that increased mRNA levels were responsible for the increased expression of 
the Renilla gene. A similar enhancement of both cistrons was observed when the 

30 tobacco necrosis virus enhancer was placed in the 3' UTR of a dicistronic mRNA 
(Meulewaeter et al., BMA 4:1347-1356, 1998). 
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Two other spacer sequences and a number of controls were examined to 
distinguish between the effects of the 9 nucleotide IRES module (SEQ ID NO: 2) and 
the spacer sequence. One spacer sequence included a different 9 nucleotide segment 
of the P-globin 5* UTR (designated SH; see SEQ ID NOS: 8 and 9) and Ihe other 
included a poly(A)9 sequence (designated SIII; see SEQ ID NOS: 10 and 1 1). IRES 
activity increased up to 243-fold over background for constructs containing the SU 
spacer and up to 31-fold for constructs containing the Sm spacer. In all cases, 
increasing the number of copies of the IRES module resulted in increased IRES 
activity, although the level of activity was affected differently by the different spacer 
sequences. The different activities due to the spacer sequences may be due to'an 
effect on the higher order structure of the mRNA, which may alter the presentation of 
the IRES modules to the translation machinery. Alternatively, some i^acer sequences 
may introduce nucleotides that contribute to the IRES module, or that may themselves 
synergize with the IRES modules. However, control experiments using three 
different combinations of the spacer sequences alone - (SII/SIII)5 and (Sll/Sni), 
altematmg in position as they do in the expCTunental constructs - revealed fliat the 
spacer sequences did not have any detectable IRES activity (but see Example IC, 
below). 

To confirm that the IRES activity observed with the (Gtxi33-i44)io(SI)9P/RPh 
construct was independent of the reporter gene, the Photinus luciferase gene was 
replaced with the CAT gene to produce the (Gtxi33.i44),o(SI)9p/RCh construct 
Following transfection into N2a cells, CAT activity was determined by thin layer 
chromatography and liquid scintillation counting. CAT activity in lysates of cells 
transfected with this synthetic IRES construct was approximately 103-fold greater 
than that obtakied for a construct containing the ECMV IRES linked to the CAT 
reporter gene (EMCV/RCh); the activity of a control construct (RCh) was 
indistinguishable from background. As for the RPh vector, the (Gtxi33.i44),o(SI)9p 
synthetic IRES also increased Renilla expression significantly (about 6.1 -fold) 
compared to the control RCh constmct. The overall ratio of CAT expression to 
Renilla luciferase activity for the synthetic IRES (Gtxi33-i44)io(SI)9p/RCh was about 
10.5-fold greater than that of the EMCV IRES. This is a high level of IRES activity. 
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but is about 6-fold lower than that obtained with constructs contaming the Photinus 
luciferase gene as the second cistron. Thus, the reporter cistron may have an 
influence on IRES activity, as was observed with tiie EMCV IRES (Kaminski and 
Jackson, MA 4:626-638, 1998). 

5 

The activity of some viral IRESes can vary, depending on cell type (Borman et 
al., st(pra, 1997). Accordingly, the activity of the synthetic (GtXi33-i44)io(SI)9p/RCh 
IRES was examined in 1 1 different cell lines, including rat C6 gUal cells, mouse N2a 
neuroblastoma cells, human SK-N-SH (SK) neuroblastoma cells, rat B104-1-1 (B104) 

10 neuroblastoma cells, mouse NIH-3T3 fibroblasts, human HeLa cervical carcinoma 
cells, mouse P19 embryonal carcinoma cells, rat NRK nomial kidney cells, human 
K562 chronic myelogenous leukemia cells, mouse AtT-20 pituitary tumor cells, and 
mouse C2C12 muscle myoblast cells, and compared to that of the EMCV IRES, In 
all of the cell lines tested, the synthetic IRES was about 33-fold (NRK cells) to 

15 63-fold (N2a cells) more active than that of the EMCV IRES. 

To investigate the potential effects of distance between IRES modules, a series 
of constructs was generated contaimng 1 (SEQ ID NO: 12) or 2 (SEQ ID NOS: 13 
to 15) copies of the Gtx IRES module (SEQ ID NO: 2). Spacing in constructs 

20 containing 2 copies of the Gtx IRES module varied firom 9 to 54 nucleotides (see SEQ 
ID NOS: 13 to 15) using^multiple copies of the SI spacer sequence, such that all of 
constructs were identical in size. The IRES activity obtained with 1 copy of the IRES 
module increased 2.8-fold when a second copy was introduced 9 nucleotides upstream 
of the first module. This activity decreased to about 2.1-fold and 1 .3-fold as the 

25 distance between the IRES modules was increased to 27 and 54 nucleotides, 

respectively. Further experiments revealed that IRES activity continued to increase as 
the number of multimers was mcreased to 35, and that multimers of up to 50 Gtx 
modules (SEQ ID NO: 2), which was the most examined, had IRES activity, though 
somewhat less than that obtained using 35 copies. These results demonstrate that at 

30 least two and at least up to 50 of the Gtx IRES modules (SEQ ID NO: 2) can be 

operatively linked, and that the IRES activity of the combined modules can be varied 
by varying the distance between the IRES modules. 
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C, IRES Activity ofMntaiitGtx Modules 

Constructs contSining 5 copies of the Gtx IRES module (SEQ ID NO: 2), each 
separated by the 9 nucleotide SI P-globin leader sequence, or 5 copies of a mutated 
5 version of the Gtx IRES module were cloned into the RPh vector, transfected into 
N2a cells, and examined for luciferase activity. Portions of these constmcts are 
shown in Figure 2 (compare SEQ ID NO: 6, which contains the wild type Gtx IRES 
module (SEQ ID NO: 2), with SEQ ID NOS: 17 to 19, which contain substitutions in 
the module; see, also, SEQ ID NOS: 20 to 22, which contain substitutions in the 
1 0 spacer sequence). 

Substitution of adenosine residues at the 3' end of SEQ ID NO: 2 increased the 
IRES activity of the oligonucleotide (compare SEQ ID NO: 6 with SEQ ID NOS: 17 
and 19). Changes in flie composition of the spacer nucleotides adjacent to tfie Gtx 

IS IRES module either had no effect or also increased IRES activity (compare SEQ ID 
NO: 6 with SEQ ID NOS: 20 to 22). These results indicate that, when present in a 
particular context, the Gtx IRES module (SEQ ID NO: 2) provides a defined level of 
IRES activity, and that additional sequences based on the Gtx IRES module, as well 
as the context of the module with respect to flanking sequences can provide different 

20 levels of IRES activity. 

EXAMPLE 2 

roENTTFICATION AND CHARACTERIZATION OF IRES ELEMENTS 
FROM DENDRICTICALLY LOCALIZED RNA MOLECULES 
25 This example demonstrates tiiat RNA molecules that localize in dendrites contain 

5' UTR nucleotide sequences that act as internal ribosome binding sites (IRESes) 

A. Vector and Cell Lines - 

The 5' untranslated regions (5* UTRs) fiom the alpha subunit of Cam Kinase n 
30 (CamK Ha; SEQ ID NO: 23), dendrm (SEQ ID NO: 24), the activity-regulated 
cytoskeletal protein (ARC; SEQ ID NO: 25), microtubule-associated protein 2 
(MAP2; SEQ ID NO: 26) and heurogranin (RC3; SEQ ID NO: 27) were obtained by 
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RT-PCR amplification of rat adult hippocampal cDNA. Amplified 5* UTRs 
corresponded to the transcriptional start sites of the mRNAs and ended immediately 
5' to the AUG translation start site. Primers were engineered with restriction 
endonuclease recognition sites for Eco RI and Nco I, to allow for directional cloning 
5 into the dicistronic vectors, RP and RPh, each of which encodes Renilla luciferase in 
the first cistron and Photinus luciferase in the second cistron (see Example 1 A). All 
5* UTRs were verified by sequence analysis. 

The monocistronic luciferase vector, PM, was generated by substituting the 
10 Photinus luciferase gene firom the RP construct for tiie EYFP (enhanced yellow 
fluorescent protein) open reading frame m tiie pEYFP-Nl vector, which contains a 
CMV promoter (Clontech) using the Eco RI and Xba I sites. As an internal control 
for transfection efficiency, the region of the chloramphenicol acetyl transferase (CAT) 
rqwrter gene construct of tiie pCAT3 control vector, which contains the SV40 
1 5 promoter, CAT reporter gene and polyadenylation sequences (Promega), was also 
cloned into the monocistronic vector. The inverted repeat encoding the hairpin 
structure at the 5' end of RPh was amplified by PGR and inserted into tiie 5' region of 
the multiple cloning site in PM to yield the monocistronic hairpin construct PMh. 

20 Monocistronic luciferase constructs were generated from the EYFP-Nl 

(Clontech) vector by replacing^e gene encoding enhanced yellow fluorescent protein 
(EYFP) with the Photinus luciferase gene from the RP constructs using the Eco RI 
and Xba I restriction sites. The chloramphenicol acetyl transferase (CAT) gene was 
used as a control for transfection eflBciency. A blunt ended Bam HI/Bgl II fragment 

25 contaming the SV40 promoter, tiie CAT reporter gene, and polyadenylation sequences 
from the pCAT3 control vector (Piomega) was cloned into the Ase I restriction site of 
these constructs after filling in tiie 3' recessed ends using the Klenow fragment of 
DNA polymerase I. 

30 An EYFP construct (EYFP/WLS) was used to test tiie ability of the RC3 

3' UTR to target mRNA to dendrites. This construct contains a nuclear localization 
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signal (NLS) and a nuclear matrix binding site (NMBS) attached to the EYFP coding 
sequence to limit diffusion of EYFP from tiie cell body to the dendrites. 

For in situ analysis of IRES activity, a dicistronic fluorophore vector was 
5 constructed that encodes enhanced cyan fluorescent protein (ECFP) as the first cistron 
and EYFP as the second cistron. Both fluorescent cistrons were modified to contain 
Ihe mouse ornithine decarboxylase (ODC) destabilizmg motif, which shortened the 
half-lives of these proteins to approximately 4 hr. This motif was obtained from the 
EGFP-Nl vector (Clontech) as a Bsr GI/Not I fragment The mRNA expressed from 
10 this construct was targeted to dendrites using the 3' UTR from the RC3 gene. 

Rat neural tumor B104 cells, mouse neuroblastoma N2a cells, rat glial tumor 
C6 cells, mouse NIH 3T3 (3T3) fibroblasts, and mouse LMTK fibroblast cell Imes 
were used for these studies. Cells were prepared and transiently transfected as 

1 5 previously described (Chappell et al., supra, 2000). Primary cultures of dissociated 
hippocampal neurons were cultured on poly-L-lysme coated chamber slides (Spoms 
and Jenkinson, Neuroscience 90: 1057-1073, 1997, which is incorporated herein by 
reference). Culture media were exchanged twice weekly until neurons had reached 
maturation (about 21 to 25 days in vitro). Hippocampal neurons were transfected 

20 using calcium phosphate (Promega). 

B, Reporter Gene Assays 

Cells were prepared 12 to 14 hr prior to transfection by plating approximately 
70,000 to 80,000 cells per well in a Falcon 6-well culture cluster. Cells were grown 

25 in DMEM with 10% FBS supplemented with 1% penicillin, streptomycin and 

glutamine. Dicistronic luciferase vectors (0.5 p.g) were co-transfected with 0.2 \ig of 
fi-galactosidase reporter vector (CMVB, Promega) per well using FuGENE 6 
transfection reagent according to manufecturer's instructions (Boehringer; Germany), 
Transfected cells were grown for 22 to 24 hr, then rinsed briefly with PBS and lysed 

30 using 250 pi lysis buffer (Promega). Extracts (20 |li1) were assayed for Renilla and 
Photinus luciferase activity using the dual luciferase assay kit (Promega). 
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p-galactosidase was measured simultaneously using the fluorescent reporter fi-gal 
assay kit Q^olecular Probes; Eugene OR). All constructs were transfected at least 
three times and individual transfections were performed in triplicate. 

5 Monocistronic vector transfections were performed as above, but were not 

cotransfected with pCMVB vector. B104 cells were transfected with monocistronic 
DNA constructs, then lysed and assayed for Photinus luciferase and CAT enzyme 
activity using single luciferase assay reagents (Promega) and a CAT enzyme assay kit 
(Promega) according to the manufacturer's instructions. Quantification of CAT 

1 0 activity was performed using a liquid scintillation counter. In experiments usmg the 
4E-BP1 mutant (see below), eiflier a plasmid e^qpressing the dominant negative form 
of the eIF4E bindii^ protein 4E-BP1, or an empty expression vector was co- 
transfected m parallel with the monocistronic constructs (Gingras et al.. Genes Devel. 
13: 1422-1437, 1999, which is incorpomted herein by reference). 

15 

Total RNA from transfected cell lines was extracted using TRIZOL reagent 
(Gibco BRL; Gaithersbiurg MD) and was size separated on a 1.3% agarose/ 
formaldehyde gel. RNA was transferred to nylon membrane using capillary transfer, 
20 baked for 1 hr at SO^'C, and prehybridized in 50% formamide, 5X SSC, 50 mM 
NAHPO4, 1% SDS, 5% dextran suJiat^^d-200 jig/ml polyadenylate for 2 hr 
at70°C. 

cDNA encoding the full-length Photinus luciferase reporter enzyme was 
25 directionally subcloned from tiie pGL3 (Promega) vector into the PBS SK n (+) 
vector (Stratagene; La Jolla CA) using Hmd m and Xba I restriction sites. The 
resulting plasmid was linearized with Hind m and used as a template for synthesis by 
in vitro transcription of ^^P-radiolabeled and digoxygenin (DIG)-labeled cRNA 
probes for northem blot analysis and in situ hybridization (Bizon et al., J. Comp. 
30 NeuioL 408:283-298, 1999; Pinkstafif et al., Brain Res. Mol. Brain Res. 55:265-276, 
. 1 998, each of which is incorporated herein by reference). 
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For fluorescent microscopic analysis, transfected neurons were fixed using 
4% paraformaldehyde in phosphate buffered solution, placed under cover slips using 
Slow-Fade reagent (Molecular Probes), a^d visualized and quantified as described 
previously (Vanderklish et al., Proc. Natl Acad. Sci.. USA 97:2253-2258, 2000, 
5 which is incorporated herein by reference). Images were captured from each 

fluorescent filter channel for equal time periods and optimized so that the emission 
from both fluorophores was within a linear range. At least 20 images were quantified 
for each construct Experimental data were analyzed for statistical significance using 
one-way analysis of variance (ANOVA).followed by an individual analysis of 
10 e>q)erimental versus control data points using a two-tailed Student's t-test. Results 
were considered significant at p < 0.05. 

D> Identification of Translational Reg ulatory Activity In the 5' TJTRs of 
Pendritically IL,ocaliz;ed mRNAs 

15 To determine the effect on translation of the fiill length leader sequences of 

CamK HA (SEQ ID NO: 23) dendrin (SEQ ID NO: 24), ARC (SEQ ID NO: 25), 
MAP2 (SEQ ID NO: 26), and RC3 (SEQ ID NO: 27), the sequences were inserted 
into the 5' UTR of a monocistronic reporter mRNA (PM), which encodes the 
Photinus luciferase protein, and examined for luciferase activity. The translation of 

20 B-globin mRNA is cap-dependent and the 5' UTR of the P-globin mRNA was 

included in these studies Wa^^trol. Monocistronic constructs w^e transfected into 
B104 cells. RNA blot analyses of total RNA from transfected B104 cells showed 
similar intensity of the autoradiographic bands between monocistronic constructs, 
indicating that the leader sequences do not affect the transcription or the stability of 

25 the monocistronic mRNAs. 



The Photinus luciferase activity of each construct was determined, and the 
results were normalized for transfection efficiency using CAT activity, which vms 
co-expressed from the PM plasmids. The B-globin 5' UTR had little effect on 
30 monocistronic translation. However, mRNA containing the MAP2 (SEQ ID NO: 26), 
ARC (SEQ ID NO: 25), or CamK Ha (SEQ ID NO: 23) 5' UTR showed enhanced 
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levels of translation compared to the base vector by up to 2.5-fold. In contrast, 
monocistronic mRNA containing the dendrin 5' UTR (SEQ ID NO: 24) generated 
luciferase activity that was approximately 20% less than control values. These results 
indicate that the 5' UTRs (SEQ ID NOS: 23 to 26) can alter the overall levels of 
5 translation of monocistronic mRNAs, and do not alter the stability of the 

monocistronic mRNAs, which could result in corresponding alterations in luciferase 
activity. 

To determine whether tiie cap-independent translation occurred by internal 
10 initiation, tiie 5' UTRs from CamK HA (SEQ ID NO: 23) dendrin (SEQ ID NO: 24), 
ARC (SEQ ID NO: 25), and MAP2 (SEQ ID NO: 26) were cloned into the 
intercistronic region of the dicistronic luciferase vector, RP. Constructs were 
transfected into the neural cell lines (N2a cells, B104 cells, and C6 cells) and assayed 
for Renilla and Photinm luciferase activity. 

15 

Little or no translation of the second cistron (Photinus luciferase) occurs in the 
base vector (RP) and, consequently, only the first cistron (Renilla luciferase) was 
translated. However, if a sequence that can initiate translation is inserted between the 
two cistrons, then both the Renilla and Photinus luciferase cistrons will be translated. 
20 The ability of sequences within the intercistronic region to initiate translation of the 
second cistron is characteristically defined-as-ffiES-activity. The results are 
calculated as a ratio of Photinus to Renilla luciferase activity and compared to the 
base vector RP. 



25 Insertion of the 5* UTR of B-globin, translation of which is cap-dependent, 

resulted in luciferase activity that was comparable to that of the RP vector alone. In 
contrast, insertion of the EMCV IRES into the intercistronic region resulted in about a 
6 to 7-fold increase in translational activity as compared to RP. Insertion of the 
5' UTRs of the dendritically localized mRNAs (SEQ ID NOS: 23 to 26) into tiie 

30 intercistronic region also resulted in increased translational activity, particularly with 
the ARC 5' UTR (SEQ ID NO: 18), which exhibited about 13-fold greater activity 
than the RP vector alone, and about 2-fold greater than that observed with the EMCV 
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IRES in C6 cells. The MAP2 (SEQ ID NO: 26), a-CamK n (SEQ ED NO: 23), and 
dendrin (SEQ ID NO: 24) 5' UTRs generated approximately 1/3 to 1/2 the translation 
activity of EMCV,"'except that the IRES activity generated by the MAP2 5' UTR 
(SEQ ID NO: 26) was nearly equivalent to the activity of the EMCV IRES in 
5 C6 cells. 

The relative levels of IRES activity between the different 5' UTRs generally 
were similar in all three neural cell lines. However, transfection of C6 cells 
consistently demonstrated higher ratios than N2a cells or B104 cells for all constructs. 
10 These results demonstrate that the 5' UTRs shown as SEQ ID NOS: 23 to 26, which 
ate from dendritically localized mRNAs, contain sequences sufficient for the intemal 
initiation of ttanslatioiL 

E, Intemal Initiation in Non-Neural Cell I.inea 

IS To determine whether the S' UTRs could also function as IRESes in 

non-neural cell types, the dicistronic constructs were transfected into 3T3 (fibroblast) 
cells and LMTK (kidney) cells^ Each of the dicistronic constructs containing the 
5* UTRs from the dendritically localized CamK HA (SEQ ID NO: 23) dendrin (SEQ 
ID NO: 24), ARC (SEQ ID NO: 25), and MAP2 (SEQ ID NO: 26) mRNAs were 

20 expressed significantly above control values, demonstrating that IRES activity was 
conferred by these sequences4flrtK)th^iKtiral and non-neural cell lines. Furthermore, 
the relative IRES activity generated from the dicistronic mRNAs in non-neural cells 
was similar to that observed in neural cell lines, with the ARC 5* UTR (SEQ ID 
NO: 25) being greater than the EMCV IRES, and the MAP2 5' UTR (SEQ ID 

25 NO: 26) bemg consistently greater than CamK Ha (SEQ ID NO: 23) and dendrin 

(SEQ ID NO: 24). However, tiie IRES activity m non-neural cells for dendrin, ARC, 
and MAP2 5* UTRs was higher than tiiat observed for any of the neural cell lines. 
These results demonstrate that the STJTRs of these dendritically localized mRNAs 
can internally initiate translation in neural and non-neural cell lines, and that the 

30 overall level of IRES activity varies among cell types. 
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F. DidstronicmRNAs Are Not Alternatively SoKced 

To detenniiie if translation of die second cistron was due to alternative 
splicing of the dicistronic mRNAs resulting in the genCTation of two monocistronic 
messages, a northern blot analysis of total KNA fromN2a cells transfected with the 

5 CamK HA (SEQ ID NO: 23) dendrin (SEQ ID NO: 24), ARC (SEQ ID NO: 25), and 
MAP2 (SEQ ID NO: 26) dicistronic constructs was performed. Equivalent amounts 
of total RNA were hybridized with a cRNA probe derived from the full-length 
Photinus luciferase open reading frame. Using this probe, any message that is spliced 
is detected as a lower molecular weight fragment However, no apparent lower 

1 0 molecular weight fragments were present, and the hybridized cRNA probe detected 
bands of the correct size for all dicistronic messages. This result demonstrates that 
the dicistronic constructs are translated as a smgle RNA species. 

a. Translation of the Second Cistm n is Not Due to Re-Initiation or 
15 Read-Through 

In addition to intemal initiation, translation of the second cistron also can be 
due to re-initiation and read-through from the first cistron. To eliminate this 
possibUity, tiie CamK HA (SEQ ID NO: 23) dendrin (SEQ ID NO: 24), ARC (SEQ 
ID NO: 25), and MAP2 (SEQ ID NO: 26) 5' UTRs were cloned into tiie RPh vector, 
20 which contains an inverted repeat sequence at the 5' end of the first cistron. The 
inverted repeat forms a stable hairpin loop structure and inhibits cap-dependent 
translation of the first cistron, dramatically reducmg the levels of Renilla luciferase 
activity. 

25 The dicistronic constructs were transfected into B104 cells and assayed for 

luciferase activity. Luciferase activity ratios for each of the four 5' UTRs was 
significantly greater than background RPh ratios, demonstrating that the 5' UTRs 
dfrect intemal initiation of translation of the second cistron even when translation of 
the first cistron is mhibited. The ratio of Photinus to Renilla for flie 5' UTRs from the 

30 dendritically localized mRNAs was elevated as compared to that of sequences lacking 
the hairpin structure, indicating that inhibition of cap-dependent translation can 
enhance cap-independent translation initiated by the 5' UTRs (SEQ ID NOS: 23 
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to 26). In comparison, the S' UTR of fi-globin was not significantly different fix>m 
RPh luciferase activity, and the EMCV IRES activity was maintained in the presence 
of the hairpin. These residts indicate that translation of the second cistron of 
dicistronic mRNAs is due to internal initiation, and not read-through or re-initiation 
5 from the first cistron, thus confirmmg that the 5' UTRs (SEQ ID NOS: 23 to 26) of the 
dendritically localized mRNA molecules examined act as IRES elements. 

H. y UTRs Confer Cap-Dependent and Cap-Independent Translation 

To examine whether the CamK HA, dendrin, ARC, and MAP2 5* UTRs (SEQ 

10 ID NOS: 23 to 26, respectively) fiinction through both cap-dependent and cap- 
independent mechanisms of translation in monocistronic mRNAs, a hairpin was 
constructed at the 5' end of the monocistronic vector PM. This base vector (PMh) was 
used to examine ttie ability of the 5* UTRs to direct internal initiation of translation 
vdthout the influence of ribosomal cap binding and scanning. The PMh-based 

1 5 constructs were transfected into B 1 04 cells and assayed for both Photinus and CAT 
enzymatic activity. 

The construct containing the B-globin 5' UTR exhibited luciferase activity that 
was sUghtly below that of the base vector, demonstrating that, as shown previously, 

20 the fi-globin 5' UTR does not initiate translation internally. All four UTRs from the 
dendriticairy localized mRNAs (SEQiD'NOS'r^S to 26) generated luciferase activity 
above that produced by the base vector . As seen with the other constructs, the ARC 
5' UTR (SEQ ID NO: 25) generated the most robust activity. However, the relative 
levels of luciferase activity produced by these 5' UTRs differed from that seen in 

25 monocistronics without hairpins. In particular, the dendrin 5' UTR (SEQ ID NO: 24), 
which demonstrated the least amount of IRES activity compared to the other 5' UTRs 
and actually decreased the level of luciferase activity in monocistronic RNA, 
produced a 3-fold increase in IRES activity in the haiipin-containing monocistronic 
RNA. Also, the MAP2 5' UTR (SEQ ID NO: 26) was not as robust when placed 

30 downstream of the hairpin as it was in the monocistronic mRNA. These results 

indicate that all four of the 5' UTRs (SEQ ID NOS: 23 to 26) utilize intemal initiation 
of translation to different extents in the monocistronic RNAs. 
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Several mechanisms can regulate cap-dependent eukaryotic translation, 
including tiie modulation of eIF4E binding to"ffiecap structure. To address whether 
these 5' UTRs could initiate translation without eIF4E bmding to the cap, 

5 monocistronic 5' UTR constructs were co-transfected with a plasmid expressing a 
dominant negative form of the 4E-binding protein, 4E-BP1 . 4E binding proteins can 
inhibit cap-dependent translation by sequestering eIF4E, thereby preventing eIF4E 
from bmding tiie cap structure, and the eIF4E/4E bmding protem complex can be 
dissociated due to phosphorylation of 4E binding protein. Mutation of the 

10 phosphorylation sites results in a dominant negative form of the 4E-BPL 

Cotransfected B104 cells were grown for 24 hr and assayed for Photinus 
luciferase activity. Luciferase activity from cells cotransfected with the plasmid 
encodmg the mutant 4E-BP1 plasmid was e^qpressed as a percentage of the luciferase 

1 5 activity in cells cotransfected with a control expression plasmid. Eiq)ression of the 
4E-BP1 mutant reduced the luciferase activity from the PM and the B-globin 
monocistronics to 45% and 43%, respectively, of the luciferase activity from cells 
containing the same 5* UTR (SEQ ID NOS: 23 to 26) constmcts transfected with the 
control e5q)ression plasmid. All of the constructs contammg the 5' UTRs from the 

20 dendritically-locaUzed mRNAs (SEQ ID NOS: 23 to 26) had luciferase activities that 
were above those of PM and 6-globin. This result indicates tiiat4ranslation of these 
mRNAs occurs by both cap-dependent and -independent mechanisms. 

The luciferase activity produced by tiie ARC (SEQ ID NO: 25) and MAP2 
25 (SEQ ID NO: 26) 5* UTRs was reduced by 38% and 32% of control, respectively, yet 
remained approxhnately 45 to 50% higher than PM and B-globin, mdicating both cap- 
dependent and cap-independent contributions to the translation of monocistronic 
mRNA. However, the CamK Ha 5' UTR (SEQ ID NO: 23), which strongly enhanced 
the translation of a monocistronic mRNA, was only reduced by about 20% of control 
30 \rfien cotransfected with the 4E-BP1 mutant Interestingly, the dendrin 5* UTR (SEQ 
ID NO: 24), \dnch had the least IRES activity in dicistronic mRNAs, demonstrated 
luciferase activity that was only 13% less than control in the presence of 4E-BP1 
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mutant, indicating that IRES-dependent translation may be a major contributor to its 
overall levels of translation. Tliese results are consistent with the results obtained 
using the hairpin constructs, and demonstrate that translation of monocistronic 
mRNAs containing a 5* UTRs from the four dendritically localized mRNAs (SEQ ID 
5 NOS: 23 to 26) occurs by both cap-dependent and cap-independent mechanisms. 

L Characterization of RC3 Translational Regnlatorv Element 

The RC3 5' UTR (SEQ ED NO: 27) was also analyzed in the dual luciferase 
dicistronic mRNA. Northem blot analysis of the RC3 5' UTR in the RP construct 

10 resulted in the appearance of two mRNAs. Removal of the chimeric intron from the 
RP vector resulted in the production of one mRNA of the correct size predicted for 
the dicistronic mRNA. The leadar sequence of B-globm mRNA as well as the EMCV 
IRES were cloned into the intercistronic region of this modified intron-less construct 
(RPi) and were analyzed in transiently transfected B104 cells. The resulting B-globin 

1 5 and EMCV P:R ratios were similar to those generated by the dicistronic mRNAs 
containing the chimeric introa The P:R ratios produced from dicistronic mRNAs 
containing the RC3 5' UTR (SEQ ID NO: 27) was approximately 18-foId higher than 
RPi and approxunately 3-fold higher than that obtained with the EMCV IRES. The 
P:R ratio obtained with constmcts containing a hairpin structure 5' of the first cistron 

20 were not significantly different fix)m those obtained with the ilP constmct. These 

results indicate that the RC3 leader sequence (SEQ ID NO: 27) also contains an IRES. 

The activity of RC3 IRES was examined in primary hippocampal neurons. To 
analyze protein synthesis in dendrites, a monocistronic fluorophore reporter construct 

25 was engineered to target the mRNAs to dendritic processes. Targeting sequences 
have been identified within the 3' UTRs of several dendritically localized mRNAs 
(Kiebler and DesGroseillers, Neuron 25:19-28, 2000, which is incorporated herein by 
reference). To determine if the RC3 3' UTR fimctioned sunilarly, it was included in 
the 3' UTR of a monocistronic mRNA encoding EYFP along with a nuclear 

30 localization sequence (EYFP/NLS). The inclusion of the RC3 3' UTR resulted in 
expression of a significant amount of the fluorescent protein in dendrites of 
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transfected neurons; constructs lacking this 3' UTR expressed the fluorescent protein 
only in the nucleus. This result suggests that the RC3 3'UTR could target mRNA to 
the dendrites of transfected neurons. 

5 In situ hybridization confirmed that reporter mRNAs containing the RC3 

3' UTR were localized throughout the processes of transfected hippocampal neurons, 
whereas reporter mRNAs lacking this 3' UTR were confined to the cell body. In 
addition, co-localization studies using the MAP2 antibody, which specifically stains 
dendrites, showed that mRNAs containing the RC3 3' UTR occurred within dendrites. 

1 0 These results validated the use of the RC3 3 ' UTR m directmg mRNAs to the 
dendritic compartment and also demonstrated that translation occurs in dendrites. 

To examine whether IRES mediated translation can occur within dendrites, a 
dicistronic mRNA containing ECFP as the first cistron, EYFP as the second cistron, 

15 and the RC3 3' UTR was used. To limit diffusion fi:om the site of synthesis and 
faciUtate quantification of de novo local protein synthesis, both fluorescent proteins 
were engineered to contain the mouse ODC destabilizing motif, which shortened the 
half-lives of these protems to approxunately four hours. As an initial analysis of the 
dicistronic fluorophore mRNAs, the constructs were tested m transfected N2a cells; 

20 IRES activities using this construct were sunilar to those observed usmg the dual 
luciferase vector. 

Hippocampal neurons were transfected with dicistronic ECFP/EYFP 
constructs contaming the 5' leader sequences from fi-globin and RC3 (SEQ ID 

25 NO: 27) mRNAs within the intercistronic region. Fluorescence measurements were 
taken from the cell body as well as from the dendrites. Fluorescent overlay images 
using a ECFP and EYPF filter were taken from the dendritic fields of two neurons 
transfected with dicistronic fluorophore constructs containing either the fi-globin or 
RC3 leader sequence in the intercistronic region, respectively. Transfection of 

30 hippocampal neurons with the dicistronic construct containing the P-globin 5' UTR 
resulted m mainly cap-dependent ECFP expression (visualized as cyan). However, 
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the mRNA containing the RC3 leader sequence (SEQ ID NO: 27) demonstrated both 
cap-dependent translation and IRES-dependent translation (visualized as yellow) 
generating turquoise. Fluorescence activity of both cistrons was quantified from 
images and calculated as a yellowxyan ratio. 

5 

The results demonstrated that the RC3 leader functioned as an IRES in both 
the cell body and the dendrites. Distribution of the activity of the RC3 IRES was 
punctate in the dendrites, indicating that activity was increased near synapses. In 
addition, the relative activity of the RC3 IRES (SEQ ID NO: 27) was approximately 
1 0 2-fold more active in the dendrites tl^ in the cell body. A northern blot of these 
dicistronic fluoiophore mRNAs contained a single hybridizing band of the correct 
molecular weight 

Since many synaptic modifications do not persist vAien translation is inhibited, 
1 5 translation has been suggested to have an essential role in mediating changes in the 
morphology and efficacy of individual synapses in response to synaptic activity. 
Moreover, it is thought that some synaptic changes result from translation at post- 
synaptic sites rather than from cell-wide changes in protein synthesis. Such local 
translation would enable different synapses to be modified independentiy within the 
20 same cell. Evidence to support the notion of local protein synthesis includes the 
observations that components of the translation macMHMjTSBfcWin dendrites, that 
protein synthesis can occur at these cellular processes, and that many dendritically 
localized mRNAs encode proteins that are integral to synaptic modifications, 
including structural proteins (ARC, MAP2), growth factors, ionotropic receptors, and 
25 kuiases (CamK Ila; see Kuhl and Skehel, Curr. Opin. Neurobiol. 8:600-606, 1998). 
These observations suggest that translation in dendrites is used to rapidly and locally 
synthesize some of the proteins required to strengthen active synapses. 

As disclosed herein, the leader sequences of five dendritically localized 
30 mRNAs (SEQ ID NOS: 23 to 27) contain IRES activity, which mediates cap- 
independent translation. In addition, translation mediated by the RC3 5' UTR (SEQ 
ID NO: 27) occurred by both cap-dependent and cap-independent mechanisms within 
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dendrites of cultured hippocampal neurons. Furthermore, translation by the cap- 
independent mechanism was relatively more efficient in the dendrite than in the cell 
body. 

5 These 5* UTR sequences having IRES activity can serve to ensure or regulate 

translation of specific messages under varying conditions and can be involved in 
efifectmg synaptic plasticity. The identification of an IRES in the 5' UTR of the 
mRNA encoding the alpha subunit of Cam BCinase n is consistent with residts 
showmg increased translation of this mRNA following treatment of developing 

10 synapses with NMDA receptor agonists, even though total protein synthesis was 
impaired (Scheetz et al., Nat. Neurosci. 3:21 1-216, 2000). It has also been reported 
that the 5' UTR of the mRNA encodmg Pun kinase, a protem that has recently been 
implicated in the stabilization of LTP also contains an IRES (Johannes et al., Proc, 
Natl. Acad. Sci.. USA 96:13118-13123, 1999). 

15 

Inasmuch as ribosomes and initiation factors are present at low concentrations 
in dendrites, IRESes can be used by dendritically localized mRNAs to mcrease their 
ability to recruit the translation machinery within this cellular compartment. Indeed, 
IRESes appear to be used by some mRNAs when cap-dependent translation is 

20 reduced or blocked. For instance, overall levels of translation are reduced d\mng 
apoptosis and during various types of cellular stress such as ischemia (Sheikh and 
Fomace, Oncogene 18:6121-612, 1999) and m these situations, IRES-mediated 
translation continues(Holcik et al., Nat CelL Biol. 1 :190-192, 1999; Stem et al., MoL 
Cell. Biol. 18:3112-3119, 1998; Stoneley et al„ MolCell. Biol 20:1162-1169, 2000). 

25 In addition, it has been reported that synaptic activity leads to an mcrease in protein 
synthesis in or near the active synapses, an increase that could saturate the cap- 
dependent translation machinery in dendrites. Thus, internal initiation can provide a 
means to initiate translation of dendritically localized messages in response to 
synaptic stimuli. 
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EXAMPLES 

roENTBFICATION OF TRANST.ATT ONAT. REGULATORY ELEMENTS 
^ THAT ARE ACTIVE TN GROmNG YEAST CELLS 
This example demonstrates that translational regulatory elements are present 
5 in mRNAs encoded in yeast cells, and that isolated elements of the yeast mRNA 
sequences have translational regulatory activity in vegetatively growing yeast cells. 

In higher eukaryotes, translation of some mRNAs occurs by internal initiation. 
It is not known, however, whether this mechanism is used to initiate the translation of 

0 any yeast mRNAs. As disclosed herein, natuifally occurring nucleotide sequences that 
function as IRES elements withm the 5' leader sequences of Saccharomyces 
cerevisiae YAPl and pi 50 mRNAs were identified. When tested in the 5' UTRs of 
monocistronic reporter genes, both leader sequences enhanced translation eflSciency 
in vegetatively growing yeast cells. Moreover, when tested in Ihe intercistronic 

5 region of dicistronic mRNAs, both sequences exhibited IRES activity that functioned 
in living yeast cells. The activity of the pi 50 leader was much greater than that of the 
YAPl leader. Further analyses of the pi 50 IRES revealed several non-overlapping 
segments that independently mediated internal initiation. These results demonstrate 
that the pi 50 IRES has a modular structure similar to IRES elements contained within 

0 some cellular mRNAs of higher eukaryotes. Both YAPl and pl50 leaders contained 
several cdimplementaiy sequence matches to yeast 1 8S rRNiC^''^^''''**^ 

The plasmid pMyr (Stratagene) was used as backbone for both dicistronic and 
monocistronic constructs. An adaptor containing restriction sites Hind HI, Pst I, 

5 Nhe I, Eco RI, Nco I, and Xba I was introduced into the pMyr vector immediately 
downstream of the GALl promoter, using Hind III and Xba I as cloning sites. The 
PstI and Xbal sites were used as cloning sites for a fragment from the RPh dicistronic 
reporter vector (see Example 1 A). The resulting construct, pMyr-RP, encodes a 
dicistronic mRNA that encodes Renilla (sea pansy) and Photinus (firefly) luciferase 

0 proteins as the first (upstream) and second (downstream) cistrons, respectively. These 
cloning steps resulted in a 5' UTR that differs slightly from that in the RP mRNA 
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described previously (Example 1 A)- The CYCl terminator sequence contained 
within pMyr-1 vector provides signals for termination of transcription and 
polyadenylatioiL 

5 The pl50, YAPl, and CLN3 leader sequences were PGR amplified using 

yeast genomic DNA as a template. These leader sequences were cloned into the 
mtercistronic region of the pMyr-RP vector using Eco RI and Nco I restriction sites 
that were introduced at the 5' and 3* ends of the leader sequences to generate 
constructs designated as pMyr-pl50//lP, pMyr-YAPl/RP, and pMyr-CLN3/RP. A 

10 haiipin structure witii a predicted stability of -50 kcal mol'* (Stoneley et al., svpra^ 
1998) was introduced mto the 5' UTR of the dicistronic constructs to generate 
pMyr-pl50/RPh,pMyr-YAPl/RPh,andpMyr-CLN3/RPh. Deletions and firagments 
of the pl50 leader were generated by PGR amplification of the pl50 sequence, again 
using Eco RI and Nco I as clonmg sites. 

15 

Monocistronic constructs containing the Photinus luciferase gene were 
generated in the modified pMyr vector. The Photinus luciferase gene was obtained 
firom the pGL3 control vector (Promega) as an Nco I/Xba I firagment and cloned using 
these same sites to generate construct pMyr/P. The leader sequences fi:om YAPl, 

20 pi 50, and GLN3 mRNAs, as well as the hairpin structure were cloned into the pMyr/P 
vector using the same restriction sites used for the dicistronic constructs. Constructs 
containing the CAT gene were cloned into the pGADlO vector (Glontech). The 
pGADlO vector was digested with Hind HI and an adaptor containing restriction sites 
Hind m, Pst I, Nhe I, Eco RI, Nco I, and Xba I was introduced into this site, which is 

25 immediately downstream of the ADH promoter. The GAT gene was obtained firom 
the pCAT3 control vector (Promega) and cloned into the modified pGADlO vector 
using Nco I and Xba I restriction sites. The pl50 leader sequence was introduced into 
this vector as an Eco RI/Nco I firagment to generate the construct designated 
pi 50/GAT. The hairpin structure described above was introduced 5' of this leader 

30 sequence to generate the construct designated pl50/CATh. 
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The yeast strain EGY48 (MATa, his3, trp\, urai, LexAop(X€)'LEU2; Clontech) 
was xised throughout the study. Yeast strains harboring the pMyr based plasmids 
were grown overnight in 4 ml synthetic defined medium (SD) with uracil and glucose. 
The following mommg, cells were harvested, washed with 4 ml H2O, and grown for 
3 hr in 4 ml SD medium without uracil with the addition of 2% galactose and 1% 
rafBnose. Cells harboring the pGADlO constructs did not require induction and were 
cultured in 4 ml SD/Ura glucose medium overnight. Cells were lysed with Ix lysis 
buffer (diluted fireshiy from 5x stock; Promega) in tubes with glass beads. Tubes 
were vortexed twice for 30 sec and recovered in a microfuge spun at top speed for 
3 min at 4^C. The supernatant was recovered and 20 |il of the lysate was used to 
assay luciferase activities using the dual reporter assay system (Promega). CAT 
activity was measured usmg N-butyl CoA according to technical bulletin no. 84 
(Promega). 

RNA was isolated firom 4 ml cell culture samples. Cells were pelleted, 
washed with water, and resuspended in 400 fil of TES buffer (100 mM Tris-HCl, 
pH 7.5, 10 mM EDTA, 0.5% SDS). RNA was extracted using preheated phenol 
(65°C); the mixture was vortexed for 1 min and incubated at 65''C for one hr. 
Samples were put on ice for 5 min, then centrifuged at 15,000 rpm for 5 min and the 
top aqueous phase was collected, re-extracted with phenol once and chloroform once. 
RNA was precipitated with isopropanol, the precipitate was washed with 
70% ethanol, dried and dissolved in water. RNA samples were separated by gel 
electrophoresis usmg 1% formaldehyde/agarose gels and transferred to Nytran 
Supercharge nylon membrane (Schleicher & Schuell). The blots were probed with 
full-length fire-fly luciferase RNA antisense probe that was labeled with ^^P. 

A. Analysis of YAPiyUTR 

The 164 nucleotide YAPl leader sequence (SEQ ID NO: 28) was examined 
for translational regulatory activity in the 5' UTR of a firefly (Photinus) luciferase 
reporter mRNA (YAPl/P). Cells were transformed with constructs expressing the 
parent Photinus {-IP) mRNA, the YAPl/P mRNA, or the 364 nucleotide 5' leader of 
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tiie CLN3/P mRNA as a spacer control. Transcription of these monocistronic 
mRNAs was under control of the GALl promoter; mRNA expression was induced 
with galactose, cells were lysed after 3 hr, and luciferase activities determined and 
normalized to Photinus luciferase mRNA levels. Translation efficiency of the 
5 YAPl/P mRNA was approximately 10-fold greater than that of either the control -IP 
or CLN3/P mRNAs. This result indicates that the YAPl 5* UTR has translational 
enhancing activity. 

To determine whether the translation mediated by the YAPl transcribed leader 
10 sequence has a cap-independent component, it was tested in a dicistronic mRNA for 
its ability to mediate internal initiatioa The leader sequence of YAPl mRNA was 
placed in the intercistronic region of a dual luciferase dicistronic mRNA and 
examined for IRES activity. In these mRNA transcripts, the upstream cistron encodes 
Renilla (sea pansy) luciferase and the downstream cistron encodes Photinus 
1 5 lucif^ase. Cells were transformed with constructs encodmg the parent RP mRNA, or 
with constructs containing the YAPl or CLN3 leaders in the intercistronic region of 
the RP mRNA. The YAPl leader sequence enhanced tiie translation of the 
downstream Photinus luciferase cistron approximately 5-fold relative to that of the RP 
mRNA. In contrast, the CLN3 leader had ahnost no eflfect on the expression of the 
20 second cistron relative to that of the RP mRNA. 

Hairpin structures were inserted in the discistronic constructs upstream of the 
Renilla luciferase gene to block scanning and, thereby, reduce the translation of this 
reporter molecule. The hairpin structures blocked Renilla luciferase expression by 
25 greater than 90%. Nevertheless, the YAPl leader permitted translation of the 

Photinus luciferase gene, even when translation of the Renilla luciferase gene was 
blocked. This result demonstrates that the YAPl leader did not increase e>q)ression of 
the second cistron by remitiation or leaky scanmng. 

30 To exclude the possibility that enhanced expression of the downstream cistron 

was from shorter, monocistronic mRNAs generated by mechanisms such as RNA 
fragmentation or an unusual splicing event, RNA was isolated from transformed cells 



wo 01/55369 



PCTAJS01/02S86 



66 

and analyzed by nordiem blot analysis using a probe to the downstream Photinus 
luciferase gene» The results demonstrated that the dicistronic mRNAs were intact. 
Thus, translation of the second cistron was not due to initiation via shorter transcripts. 
Together, these results demonstrate that the YAPl 5* UTR comprises a nucleotide 
5 sequence that has IRES activity and that has translational enhancing activity. 

Analysts ofp^50 5> UTR 

The yeast pl50 5* UTR also was examined for translational regulatory activity. 

The 5' leader of the mRNA encodmg the pi 50 protem was determined by primer 
1 0 extension analysis to contain 508 nucleotides (SEQ ID NO: 29'see, also, Goyer et al., 

MoL Cell. BioL 13:4860-4874, 1993, which is mcoiporated herein by reference). 

This sequence contains 1 1 open reading frames (ORFs) and does not appear to 

contain or be part of an intron (Costanzo et al., Nucl. Acids Res. 28:73-76, 2000, 

which is incorporated herein by reference), consistent with the observation that only 
15 4% of yeast genes contain introns, 90% of which encode ribosomal proteins. The 

presence of the upstream ORFs in the pi 50 leader might be expected to inhibit 

translation by a scaiming mechanism. 

The pi 50 sequence was tested in the 5' UTR of a monocistronic reporter 
20 mRNA. Constructs containing this sequence enhanced the tratislation efficiency of 
the reporter gene up to 1 0-fold. However, the analysis was complicated by tibe ' " 
appearance of a second band approximately 1 kb, which may be a partial degradation 
product of the luciferase mRNA; this RNA was too short to encode a functional 
Photinus luciferase protein. Accordmgly, the pi 50 leader was tested in the 5' UTR of 
25 the CAT reporter gene to further evaluate whether it was functioning as a translational 
enhancer. The results obtained using the CAT reporter construct were similar to those 
obtained with the Photinus luciferase reporter gene; the pi 50 leader sequence 
enhanced the translation efficiency of the CAT reporter gene 9-fold. 

30 To determine whether any translation mediated by the pi 50 5' leader was cap- 

independent, a hairpin structure was inserted at the 5' end of this construct. Although 
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the hairpin structure inhibited translation of a control CAT mRNA by greater than 
90%, translation mediated by the pl50 leader sequence was not inhibited but, instead, 
was enhanced by approximately 3-fold. The CAT mKNA levels did not appear to be 
affected. These results demonstrate that the translation mediated by this leader 
S sequence is cap-independent. 

To confirm that translation was cap-independent, the pi 50 leader was tested in 
the intercistronic region of the dual luciferase RP dicistronic mRNA. In this location, 
the pl50 leader functioned as a potent IRES, enhancing translation of the downstream 
1 0 Photirm luciferase cistron approximately 200-fold relative to that of the RP parent 
vector. This increase in Photinus luciferase activity m the pl50/RP mRNA resulted in 
Photirm luciferase protein levels tiiat were approximately twice those of Renilla 
protein levels. 

15 Blocking the translation of the upstream Renilla luciferase gene with a hairpin 

structure resulted in an even greater enhancement of the Photinus. Renilla luciferase 
ratio, indicating that the translation facilitated by this sequence was not dependent on 
the translation of the upstream Renilla luciferase cistron. As with the findings with 
YAPl, the enhanced expression of the downstream cistron was not associated with 

20 RNA fragmentation or unusual splicing events. 

The pi 50 leader sequence was sequentially deleted from the 5' end and 
fragmented into shorter segments, includmg fragments consisting of nucleotides 100 
to 508, 160 to 508, 250 to 508, 375 to 508, 429 to 508, 481 to 508, 250 to 390, and 

25 1 to 250 of SEQ ID NO: 29, each of which was tested for IRES activity. Most of the 
IRES activity was associated witii nucleotides 160 to 508 of SEQ ID NO: 29. 
However, all of the fr^ments examined demonstrated some level of IRES activity. 
Furfliermore, deletion of nucleotides 1 to 100 or nucleotides 100 to 160 of SEQ ID 
NO: 29 increased translation by internal initiation, indicating that this 160 nucleotide 

30 region contains translational inhibitory sequences, which can inhibit IRES activity. 
The leader sequence in construct pi 50(250-508) corresponds to that of a shorter 
leader sequence that occurs naturally (Goyer et al., supra, 1993). This shorter leader 
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sequence has a level of IRES activity ihaX is similar to that of the entire 508 nucleotide 
leader (SEQ ID NO: 29). 

It was previously noted that many eukaryotic mRNAs contain short 
5 complementary sequence matches to 1 8S rRNA, raising the possibility that ribosome 
recruitment at some cellular ERESes might occur by base pairing between mRNA and 
18S rRNA (Chappell et al., supra, 2000; Mauro and Edelman, supra, 1997; Hu et al., 
st^ra, 1999; Tranque et al., siipra, 1998). Comparison of the YAPl and pl50 leader 
sequences to yeast 1 8S rRNA identified two (SEQ ID NOS: 30 and 32) and four (SB 

10 ID NOS: 34, 36, 38 and 40) complementary sequence matches, respecfively, vMch 
contained stretches of iq) to 10 nucleotides of perfect complementarity OFiguie 3). In 
addition, two of the matches are part of more extensive complementary matches of up 
to 25 nucleotides with 84% complementarity. The complementary match at 
nucleotides 130 to 142 of the pl50 IRES (SEQ ID NO: 34) is correlated with a 

15 60 nucleotide segment of the pi 50 5* UTR that can inhibit IRES activity. Another 
complementary match of the pl50 IRES at nucleotides 165 to 183 (SEQ ID NO: 36) 
is correlated with a 90 nucleotide segment of the IRES that contributes to intemal 
initiation. Two other complementary matches of the pi 50 IRES at nucleotides 423 to 
437 (SEQ ID NO: 38) and nucleotides 437 to 461 (SEQ ID NO: 40) are partially or 

20 fully contained within a 52 nucleotide segment with IRES activity. 

It was previously suggested that the yeast translation machinery may be 
capable of mediating intemal initiation (lizuka et al., MoL Cell. Biol. 14:7322-7330, 
1994; Paz et al., J, mi Chejtnt 274:21741-21745, 1999, each of which is incorporated 

25 herein by reference). The present results demonstrate unequivocally that yeast IRES 
sequences contained within the YAPl and pi 50 leader sequences are functional in 
vegetatively growing cells. In addition, numerous sequences sharing 
complementarity with yeast 1 8S rRNA were identified within both leader sequences 
(Figure 3). Many other mRNAs and cellular IRESes contain sunilar features, and the 

30 complementary sequence matches to 1 8S rRNA can function as cis-acting sequences 
that affect translation (see, for example, Chappell et al., supra, 2000). In the case of 
the Gtx IRES module (SEQ ID NO: 2), this segment is 100% complementary to a 
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sequence of 1 8S rKNA. Recruitment of ribosomes at tibis site appeared to involve 
base pairing to 1 8S rRNA within 40S ribosomal subunits. These results indicate that 
recruitment of ribosomes at some cellular IRES element, including the yeast YAPl 
and pi 50 IRESes, can occur directly due to base pairing to rRNA, a mechanism 
5 consistent with the modular nature of these cellular IRES elements. 

The leader sequence of the YAPl mRNA contained an IRES element that 
contributed to the eflScient translation of this mRNA. Sequence features of this leader 
previously have been shown to affect translation and mRNA stability (Vilela et al., 

10 NucK Acids Res, 26:1150-1159, 1998; RuizrEchevarria and Peltz, Cell 101:741-751, 
2000, each of which is incorporated herein by reference). One of these features, a 
short upstream open reading frame (uORF) did not inhibit translation of ihe main 
ORF, even though it was recognized by a lai^e fraction of the scaiming ribosome. 
Inasmuch as uORFs generally inhibit the translation of downstream cistrons, these 

1 5 results indicated that reinitiation and leaky scanning were also involved in the 
efficient translation of the YAPl mRNA 

The pi 50 IRES element was particularly active. Although most of the IRES 
activity was localized to nucleotides 160 to 508 (SEQ ID NO: 29), the IRES 
20 boundaries were not distinct. Moreover, several, non-overlapping segments 

functioned4nd^^dentiy, suggestmg that this IRES has a modular composition. 
Using methods as described in Example 1, which identified the IRES module (SEQ 
ID NO: 2) present in the Gtx homeodomain protem, IRES modules in the pi 50 
5' UTR similarly can be isolated. 

25 

The notion that short nucleotide sequences can recruit the translation 
machinery is not consistent with the proposal that higher order RNA conformations 
are uniformly important for the activity of some cellular IRESes. Indeed, the results 
obtained from deletion and fragment analyses of IRESes contained within other 
30 mammalian and insect cellxilar mRNAs indicates that many of these IRESes may also 
be modular (see, for example, Yang and Samow, Nucl. Acids Res, 25:2800-2807, 
1997; Sella et al., Mol. Cell Biol. 19:5429-5440, 1999). The modular composition of 
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cellular IRESes contrasts with those of viruses. For example, in picomavimses, the 
IRESes comprise several hundred nucleotides and contain RNA conformations ihat 
^pear to be higfily conserved and that are important for activity. 

5 It is not known how widely mtemal initiation is used by yeast or higher 

eukaryotic mRNAs. The identification of numerous insect and mammalian IRESes 
may reflect a more extensive use of this mechanism in higher eukaryotes, or it may 
reflect incidental bias that has resulted in the evaluation of many more mRNAs from 
insects and mammals than from yeast : Some mammalian IRESes do not function in 

0 living yeast. In the case of poliovkus, the inactivity of its IRES m S. cerevisiae 

reflects a specific blockage that occurs via a short inhibitory RNA. The inactivity of 
some mammalian IRESes m yeast may also reflect trans-acting &ctor reqdrements 
that are not provided by yeast cells or differences related to the ability of a sequence 
to bind a component of the translation machinery that is not identical to that in yeast. 

S For example, pi SO. is the yeast homologue of mammalian translation initiation factor 
eIF4G, but the two are not functionally interchangeable. 

In higher eukaryotes, IRESes are used by some mRNAs during the G2/M 
phase of the cell cycle and under conditions that reduce cap-dependent translation, as 

0 seen, for example, during different types of stress. In yeast, internal initiation may 
also be used to facilitate the translation of essential genes under similar conditions, 
including during periods of nutritional deficiency. It may be significant that IRESes 
were identified within the YAPl and pl50 leader sequences given that overexpression 
of YAPl confers general resistance to many compounds, and that e)q)ression of pi 50 

5 when cap-dependent translation is reduced can allow translation of other mRNAs 
under adverse conditions. 

The identification of yeast IRESes that function in vegetatively growing cells 
suggests that yeast and higher eukaryotes use similar mechanisms to initiate 
0 translation. The analysis of these mechanisms shoiild be facilitated in yeast, since 
many strains of yeast having mutations in genes involved in translation are available. 
The ability to easily manipulate this organism genetically can facilitate the 
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identification of specific factors involved in internal initiation and confinnation that 
base pairing between certain IRES sequences and 18S rRNA is important for 
recruitment of ribosomes at these sites. In addition to these scientific interests, the 
identification of yeast IRESes that fiinction as translational enhancers in 
5 monocistronic mKMAs also provides numerous explications for bioengmeering. 

EXAMPLE 4 

roE]>JTBT CATION AND CH ARACTERIZATIOIM OF 
SYNTHETIC IRES ELgMENTg 
10 This example demonstrates that translational regulatory elCTients, including 

IRES elements, can be identified by screening libraries of random oligonucleotides. 

At Retroviral Vector 

Retroviruses are extremely usefiil tools to deliver genes into eukaryotic cells 

1 5 both in culture and in whole animals. Currentiy, however, most retroviral vectors are 
not tailored for tissue specific or developmental stage specific delivery of genes. 
Thus, a benefit of screening a retroviral library for fimctional synthetic translational 
regulatory elements as disclosed herein is the potential to create novel retroviruses 
with exquisite target specificity. Such vectors can be extremely usefiil for generating 

20 cell lines or transgenic animals for diagnostic screening procedures and drug 

development In-ackhtion^-sueh vectors can be useful for gene therapy in humans. 

A retrovirus is a single stranded RNA virus that infects a cell and integrates 
into the genome of a cell by copying itself into a double stranded DNA molecule by 

25 reverse transcription. The integrated retrovirus genome is referred to as a provirus. 
Retroviruses have a two stage life cycle, existing in both an RNA and a DNA form. 
The RNA form of the virus is packaged into an infectious particle that is coated with a 
glycoprotein that is recognized by receptors on die host cell. This interaction 
promotes a receptor mediated internalization event, resulting m exceptionally efficient 

30 delivery of the viral genome into the cell. After transfport to the cell nucleus and 
uncoating, tiie RNA genome is reverse transcribed into a DNA form (a provirus). 
During the reverse transcription process, the provirus integrates into the host cell 
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genome. Retroviruses do not integrate in a completely random fashion, but instead 
have a distinct preference for integration into regions of the genome that are 
transcriptionally competent This characteristic reduces the likelihood that the 
provirus will be silenced by integration into a transcrq)tionally repressive domaiiL 

In a recombinant retrovirus, the entire coding region of the virus is removed 
and replaced with a transgene. This replacement is done by standard molecular 
biological techniques using aproviral version of the virus that is propagated as a 
bacterial plasmid (a pro-retroviral vector). However, other sequences in the retrovirus 
genome are required for the functions of viral transcription and packaging, including 
genes encoding flie vkal gag and pol proteins, and the viral glycoprotein coat While 
such sequences can be removed firom the pro-retroviral plasmid, they must be 
provided in trans, for example, on other plasmids fliat are introduced into the host cell 
via cellular transfection in order to obtain a fully functional recombinant virus. 
Alternatively, these helper functions can be designed to be integrated into the cellular 
genome of the viral packaging cell line. 

Retroviruses have two viral promoters called long terminal repeats (LTRs), 
one located at each end of the viral genome. The upstream LTR is responsible for 
promoting transcription of the DNA provirus into the RNA form. The downstream 
LTR is not used for transcription during the RNA phase of the life cycle. However, 
during reverse transcription of the RNA into the DNA provirus, the downstream LTR 
provides a template for the replication of the upstream LTR, Thus, native retroviruses 
contain identical sequences in then upstream and downstream LTRs. 

The recombmant retroviral vector designed for the IRES selection procedure 
was designated MESVR/EGFP/ECFP/RSVPro (SEQ ID NO: 16; see, also. Figure 4). 
This vector was based on the MESV/IRESneo (Owens et al.. Cancer Res.. 58:2020- 
2028 (1998); Mooslehner et al, IViroL . 64:3056-3058, 1990; Rohdewohld et al, L 
YtsL, 61 :336-343, 1987, each of which is mcorporated herem by reference). MESV 
is a C-type retrovirus that was modified to remove sequences that are necessary for 
independent replication. As such, the virus can only replicate with the assistance of 
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helper genes that encode the proteins required for viral genome packaging and 
insertion into the host genome. 

Features of the MESVR/EGFP/ECFP/RSVPro vector include that 1) a 

5 multiple cloning site was introduced into the downstream LTR for insertion of the 
exogenous sequences that can regulate transcriptional activity of a transgene encoded 
by the recombinant retrovirus, and the endogenous viral core promoter was replaced 
with a strong basal promoter to potentiate transcription promotmg activity of inserted 
sequences; 2) a mutated EGFP encoding sequence followed by a multiple cloning site 

10 to allow insertion of elements to be tested sequences and a sequence encoding ECFP 
to allow assay of translational activity on a single cell basis was introduced; 
3) enhancer elements in the upstream LTR were replaced with those from RSV to 
drive higher levels of RNA genome production m the packagmg cells; and 4) an SV40 
origin of replication was inserted in order to increase the copy number of the 

15 retroviral plasmids in the packaging cells. The EGFP and ECFP reporter genes are 
e>q>ressed as a smgle transcript, in which the mRNAs are linked by a spacer sequence, 
which can contain an oligonucleotide to be examined for IRES activity. Ejqiression 
of both reporter genes is controlled by a strong RSV promoter to ensure efficient 
transcription of the RNA viral genome and, therefore, a high viral titer. The multiple 

20 cloning site between the EGFP and ECFP coding sequences facilitates the insertion of 
an oligonucleotide to be exM^aed^ translational activity. 

The MESVR/EGFP/ECFP/RSVPro(ori) retroviral vector contains 6096 base 
pairs (SEQ ID NO: 16). Following msertion of the oligonucleotides to generate a 

25 library of provnuses, each containing a unique potential translational regulatory 
element in the intercistronic region of the dicistronic reporter cassette, the proviral 
vector library was transfected into mammalian cells together with helper plasmids 
required for viral production mcluding a plasmid that encodes the group antigen (gag) 
and the integrase enzyme (pol) that is packaged with the RNA genome as well as a 

30 plasmid that encodes the vesicular stomatitis virus coat glycoprotein coat (VSV-G). 
The DNA provirus is transcribed to generate the RNA form, which is packaged mto 
an infectious viral particle. The viral particle is coated with a glycoprotein that is 
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recognized by receptors on the host cell leadmg to receptor-mediated intraialization. 
After entry into the cell nucleus, tiie RNA genome is reverse transcribed into the 
DNA form vAdch is stably integrated into the host cell genonie. 

5 The viral packaging protocol involved a triple transfection into COS 1 cells of 

a library containing pro-retroviral vectors that harbor the putative promoter elements 
together with the two separate plasmids that encode the gag/pol and VSV-G proteins, 
respectively. Cellular transcription machinery is used to generate the viral RNA 
strands that are packaged into viral particles and subsequently bud ftom the cell 

10 membrane. These viral particles can infect a naive cell as described above. The viral 
library is fiilly representative of the original vector library because all viral RNAs 
were transoibed from the same strong promoter. In contrast, each integrated DNA 
version of tbe virus contains a different oligonucleotide sequence (or combination of 
sequences) in the intercistronic region, which can effect translation of the second 

1 5 reporter molecule and selection for which indicates IRES activity of the 
oligonucleotide. 



Packaging of the proviral vector library was achieved by cotransfection of the 
proviral DNA into COSl cells together with the packaging genes, which are 

20 contained on two separate helper plasmids, pCMV-:GP(sal) and pMD.G. The 

pCMV-GP(sM)TpteSSiid^has a cytomegalovirus promoter (pCMV) driving the genes 
that encode the group antigen (gag) and reverse transcriptase enzyme (pol) from the 
Moloney murine leukemia vurus (MMLV). The pMD.G plasmid encodes the 
vesicular stomatitis virus G glycoprotein (Naldini et al.. Science 272(5259):263-267, 

25 1996, which is incorporated herein by reference). These two plasmids were 

cotransfected into COSl cells along with the library of recombinant retroviral vectors 
containing putative promoter elements in order to generate a library of retroviruses. 

B, Selection of Synthetic Oligonncleotides Having IRES Activity 
30 The disclosed synthetic IRES methodology provides a means for selecting . 

functional IRES elements. The IRES selection method allows the parallel screening 
of millions of random oligonucleotide elements or combinations of elements for 
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activity in mammalian cells. Selection of synthetic IRES elements in mammalian 
cells is facilitated if 1) each cell receives a single unique cassette to avoid selection of 
inactive elements that are fortuitously present in the same cell as an active element 
2) the synthetic IRES is shielded from the effects of genomic sequences that may 
5 activate or repress translation; 3) the delivery system is efficient so that a complex 
library can be readily screened; and 4) the selection process is strmgent and is based 
on a reporter gene assay that is highly sensitive and faithfully reports the activity of 
the IRES elements. 

10 A library of oligonucleotides was ligated unmediately upstream of the second 

nucleotide sequence of a dicistronic reporter cassette comprising two reporter genes 
by insertion into a cloning site in the intercistronic spacer sequence. The exemplified 
reporter cassette (see below) contained nucleotide sequences encoding enhanced 
green fluorescent protein (EGFP) and enhanced cyan fluorescent protein (ECFP), 

1 5 which were arranged in a dicistronic construct that allows two separate gene products 
to be made ftom a single mRNA that is driven by a smgle promoter. After infection 
of cells with the retroviral IRES element library and integration mto the genome, each 
IRES was scored for its translational activity by examining the activity of the ECFP 
reporter gene relative that of EGFP. After 2 to 3 days of infection, uninfected cells 

20 were selected by FACS to obtain cells expressing both EGFP and ECFP; the level of 
ECFP expression in each cell reflectedAe-strength of an individual synthetic IRES 
element cassette, such that highly fluorescent cells are likely to contain highly active 
IRES elements. After multiple rounds of selection, the IRES sequences were 
amplified firom the cellular genome by PGR and sequenced using an automated DNA 

25 sequencer to determine the identity of each of the synthetic IRES elements. The 

activity of each selected IRES element was confirmed by amplifying the entire IRES 
element, inserting the amplified element into a dicistronic luciferase reporter vector, 
and screening for the second luciferase reporter protein under translational control of 
the inserted IRES, This method allowed the testing of the regulatory cassette in a 

30 different reporter system, which was more amenable to quantitation of IRES activity 
levels. 
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Except as indicated, methods were perfonned essentially as described in 
Examples 1 to 3. A pool of random 1 Smers, flanked on either side by two different 
invariant sequences each 6 base pairs in length, was prepared and insSted into the 
Mlu I site in the intercistronic spacer of MESV/EGFP/ECFP/RSVPro (see Figure 4). 
5 A library of recombinant retroviruses was made by transiently transfecting COS 1 
cells together with plasmids encoding the MLV gag/pol genes and the VSV-G 
glycoprotein gene. The library was introduced into B104 cells, then 48 hr later, the 
cells were subjected to FACS (see below) and cells expressing high levels of EGFP 
and ECFP were collected. The selected cells were replated, then sorted again for 
10 EGFP and ECFP expression. Genomic DNA was extracted from the twice-selected 
cells, and the ISmers were isolated by PCR using primers complementaiy to the 
sequences flanking the Mlu I cloning site in Hlg vector. 

IRES activity of the PCR amplified sequences was confirmed by cloning the 
15 fragments into the intercistronic region of the dicistronic reporter vector, RPh 
(ChappeU et al., Proc.NatL Ac ad. ScL.USA 97:1536-1541, 2000, which is 
incorporated herein by reference). Individual plasmid clones were transfected into 
B104 cells and the luciferase activities of the first cistron (Renilla luciferase) and the 
second cistron {Photinus luciferase) were assayed. For a given plasmid clone 
20 containing a particular 1 8mer sequence, an increase in the translation of the second 
cistron relative to thfe*fiT§reSffdh'and normalized to the empty vectcft indicated that 
the 18mer fimctioned as an IRES element. 

Oligonucleotides selected usuig this method and having IRES activity are 
25 shown as SEQ ID NOS: 42 to 46, where ttie first six nucleotides and last six 
nucleotides in each of &e sequences are the invariant sequences flanking the 
randomly genemted 18 nucleotides. These results demonstrate that synthetic IRES 
elements can be isolated by screening a library of random oligonucleotides using the 
disclosed methods. 
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r jgi^lftction of Additional IRES Elements 

To confiim the validity of the retroviral screening method, and to identify 
other short sequences with properties similar to those of the Gtx IRES module (SEQ 
ID NO: 2), B 1 04 cells were infected with two retroviral libraries that contained 

5 random sequences of 9 or 1 8 nucleotides in the intercistronic region. Cells expressing 
both cistrons were sorted and sequences recovered from selected cells were examined 
for IRES activity using a dual luciferase dicistronic mRNA. Two novel IRES 
elements were identified, each of which contained a sequence with complementarity 
to 18S rRNA. When multiple copies of either element were linked together, IRES 

10 activities were dramatically enhanced. Moreover, the synthetic IRESes were 

differentially active in various cell types. The similarity of these properties to those 
of the Gtx IRES module (SEQ ID NO: 2) provides confirmatory evidence that short 
nucleotide sequences can fimction as translational regulatory elements. 

1 5 Two retroviral libraries were generated. In the first library, an oligonucleotide 

containing 18 random nucleotides (N)i8 was cloned into the Mlu I site of the 
polylinker. The sequence of this oligonucleotide is: acgcgtgatcca(N)i8Cgagcgacgcgt 
(SEQ ID NO: 47; see Edehnan et al., Proc. Natl. Acad. ScL. USA 97:3038-3043, 
2000, which is incorpomted herein by reference). In the second library, an 

20 oligonucleotide containing two segments of 9 random nucleotides (N)9 was cloned 
into the Pac I and Mlu I sites of the polylinker. -The sequence of this oligonucleotide 
was ttaattaagaattcttctgacat(a)9ttctgacat(a)9ttctgacat(a)9(l>09(a)9^^ - 
gactcacaaccccagaaacagacatacgcgt (SEQ ID NO: 48), where N and N' are different 
random nucleotide sequences. The design of this oligonucleotide was based on the 

25 oligonucleotide (Sm/SiOsP (see Example 1). This oligonucleotide did not have IRES 
activity and was used as a spacer control. The first library consisted of about 
2.5 X 10^ bacterial clones and the second consisted of about 1 .5 x 10^ bacterial clones. 
As such; each library represented only a small fiaction of the potential sequence 
complexity of the random oligonucleotides (about 6.9 x 10^^. 
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The retroviral libraries were packaged in COSl cells. Subconfluent cells were 
triply transfected using the FuGENE 6 reagent Cloche Molecular Chenucals; 
Indianapolis IN) with plasmids encoding 1) the refibviral Ubrary, 2) MoMuLV gag 
and pol genes (pCMV-GP(sai)) and 3) the VSV-G glycoprotein (see Tranque et al., 
5 supra^ 1998; Naldini et al., supra, 1996). After 48 hr, retroviral particles were 

recovered from culture supernatant, jSltered through a 0.45 )xm membrane, and then 
used to infect B104 rat neural tumor cells (Bottenstein and Sato, Proc. Natl. Acad. 
ScLJ:^ 76:514-517, 1979). 

10 Approximately 2 x 10^ COSl cells were transfected, and approxunately the 

same number of B104 cells were subsequently mfected. After 72 hr, cells were 
harvested and sorted by FACS on a FACSVantage SE. (Becton Dickinson; San Jose 
CA). EGFP was excited with an argon laser tuned to 488 mn and fluorescence was 
recorded through a 530 nm bandpass filter. ECFP was excited with a krypton/argon 

1 5 laser tuned to 457 nm, and fluorescence was measured through a 495 nm bandpass 
filter. As controls for the FACS, B104 cells were infected with the following 
reference viruses: the parent vector (MESV/EGFP/ECFP/RSVPro; SEQ ID NO: 16), 
a virus encoding EGFP, a virus encoding ECFP, and a virus that contains the IRES 
from the encephalomyocarditis virus (EMCV) in the intercistronic region of the 

20 parent vector. 

Cells co-expressing both EGFP and ECFP were isolated and returned to 
culture for 14 days. These cells were then resorted, and high co-expressors were 
isolated and further expanded in culture for 5 to 7 days. Genomic DNA was prepared 

25 using a QiaAmp DNA miniprep kit (Qiagen). Intercistronic sequences were amplified 
by PCR usmg flanking primers, and cloned into the intercistronic region of RPh, 
which is a dicistronic vector that encodes Renilla luciferase protein as the first cistron 
and Photinus luciferase protein as the second cistron (Example 1; Chappell et al., 
supra^ 2000). B104 cells were transiently co-transfected with the dual luciferase 

30 vector and with a vector expressing P-galactosidase, and luciferase and 

p-galactosidase assays were performed (see Example 1). Photinus luciferase activity 
values were normalized for transfection efficiency by means of P-galactosidase 
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activity, and were tiim nonnalized to the activity of tiie RPh parent vector (first 
library) or of RPh containing the (Sm/SiOsP oligonucleotide as a spacer control 
(second library). 

5 Sequences of the oUgonucleotide inserts were determined using an ABI 

system sequencer (PE Biosystems, Foster City, CA), and were compared using the 
Clustal X multiple sequence aligmnent program (Thompson et al., Nucl Acidg Re^, 
25:4876-4882, 1997), and with the BestFit program from the Genetics Computer 
Qroup software package (Devereux et al., NucL Acids Res. 12:387-395, 1984). 

1 0 Sequence matches were evaluated by comparing BestFit quality scores to fliose 

obtained when the selected sequences were randomly shuffled 10 times and compared 
to 18S rRNA. Secondary structure predictions were made using mfold version 3.0 
(Zuker et al., m "RNA Biochemistry and Biotechnology" (ed. Clark; Kluwer 
academic pubUshers 1999), pages 1 1-43; Matiiews et al., .T,Mol Biol 288:91 1-940, 

1 5 1 999). Northern blot analysis was performed as described in Example 1 usmg a 
riboprobe encompassing the entire coding region of the Photinus luciferase gene. 

The retroviral library containing the random 18 nucleotide inserts was 
examined. This library, derived from 2.5 xlO^ retroviral plasmids was used to infect 

20 approximately 2 x 10^ rat B104 neural tumor cells. After 72 hr, cells that 

co-expressed botii EGFP and ECFP, corresponding to approximately 0.5% of the 
cells, were isolated by FACS. These cells were cultured for 14 days, sorted again by 
FACS, and high co-expressors, corresponding to approximately 4% of cells, were 
collected and grown. The twice sorted cells were compared to cells that had been 

25 infected with tiie vmis that contained the EMCV IRES between the EGFP and ECFP 
genes. Both cell populations showed variable expression suggesting that IRES 
activity can vary among individual cells, perhaps reflecting cell cycle differences in 
the population. . 

30 Intercistronic sequences contained within the population of twice sorted cells 

were isolated by genomic PGR, and cloned into the intercistronic polylinker of the 
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RPh vector (see Example 1). This dual luciferase vector has a stable hairpm-forming 
sequence in the transcribed leader region upstream of the Renilla open reading frame. 
The hairpin structure blocks scanning ribosomes and therefore suppresses translation 
of the first cistron. Fifty clones were picked at random and plasmid DNA was 
5 prepared, sequenced, and transiently transfected into B104 cells. Of the 45 clones that 
were successfully sequenced, 39 contained unique 18 nucleotide inserts. The 
sequences of the other 6 clones were each represented more than once, which may 
reflect the relatively low complexity of selected sequences in these twice sorted cells, 

1 0 The sequenced clones were tested in transfected cells and most activities were 

weak or at a background level. However, one sequence, designated intercistronic 
sequence 1-23 (ICSl-23; SEQ ID NO: 49) demonstrated enhanced PhoHnus luciferase 
activity approximately 8-fold greater than the control constructs. This level of 
activity was similar to tihat observed for one copy of the Gtc IRES module 

15 (Example 1). 

A sequence comparison between ICSl-23 (SEQ ID NO: 49) and 188 rRNA 
(SEQ ID NO: 51) revealed a complementary match between the 3' end of the IRES 
and 18S rRNA at nucleotides 13 1 M324 (Figure 5). This match has a BestFit quality 

20 score that is significantly greater than that obtained with 10 randomized variations of 
this sequence! To address whether the tef S^^^mplementarity within ICS 1-23 was 
associated with the IRES activity, the 30 nucleotide ICSl-23 sequence, which 
includes the 18 nucleotide random sequence together with 12 nucleotides of flanking 
sequence, was divided into two segments of 15 nucleotides each, ICSl-23a, \^ch 

25 lacked any complementarity to 18S rRNA (see Figure 5, and ICSl-23b (SEQ ID 
NO: 50) contained the complementary match to 18S rRNA (SEQ ID NO: 51). 

As disclosed in Example 1, multiple linked copies of the Gtx IRES module 
(SEQ ID NO: 2) were more active than the corresponding monomer. Accordingly, 
30 multuners of each segment of ICS 1 -23 were synthesized, with each repeated segment 
separated by nine adenosine nucleotides (poly(A)9). Three linked copies of the 
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ICSl-23a segm^t (see Figure S) did not enhance Photimis luciferase expression. In 
contrast, constructs containing three and five linked copies of ICS 1 -23b (SEQ ID 
NO: 51) enhanced Photinus luciferase activity as compared to ICSl-23. These results 
indicate that the sequence of ICSl-23 tiiat shares complementarity vdth 18S rRNA 
5 has IRES activity. Northern blot analysis of RNA from cells expressing the five 

linked copies of ICS 1 -23b revealed a single hybridizing band corresponding in size to 
the full length dicistronic mRNA, thus confirming that ICS-23b did not enhance 
Photinus luciferase activity by other mechanisms such as altemative splicing or by 
functioning as a promoter. 

10 

The second retroviral library, which contained random 9 nucleotide segments 
separated by a poly(A)9 spacer in the intercistronic region of the encoded dicistronic 
mRNA, vias examined in order to identify smaller translational regulatory elmients. 
Incorporation of the spacer sequence was based on the determination that the Gtx 
1 5 IRES module (SEQ ID NO: 2), when present in multiple copies separated by the 

poly(A)9 spacer, exhibited greater IRES activity than a smgle copy of the module (see 
Example 1). 

Approximately 2 x 10^ B104 cells were transduced with the second retroviral 
20 Ubrary, which was derived from 1 .5 xlO^ retroviral plasmids. Approximately 0,3% of 
the cells were selected by FACS, and cultured and sorted a second=tirae; 
Approximately 3% of the latter cells were high co-expressors. The oligonucleotide 
inserts were recovered by genomic PCR and shotgun cloned into the intercistronic 
region of the RPh. One hundred clones were picked at random and 84 were 
25 successfully sequenced, yieldmg 37 different sequences. Fifteen of the sequences 
were represented two or more times, mdicating that the complexity of the sequences 
represented in these twice sorted cells was somewhat lower than that of the first 
library. When tested by transient transfection in B 104 cells, most sequences enhanced 
Photinus luciferase activity weakly (about 2-fold or less above background), and none 
30 were as active as ICSl-23 (SEQ ID NO: 49). 
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Six of the sequences, which were isolated four or more times from the twice 
sorted cells, were examined fiirther. Each of these sequences contained two 
9 nucleotide segments, which were tested individually as five linked copies. One of 
these constructs, containing a 9 nucleotide segment designated ICS2-17.2 
5 (TCCGGTCGT; SEQ ID NO: 52), showed enhanced Photinus luciferase activity. In 
contrast to the five linked copies of ICS2-17.1, the other 9 nucleotide segment 
contamed within selected sequence ICS2-17 did not have IRES activity. RNA 
analysis confirmed that a single transcript was produced from the construct, and that 
the increase in Photinus luciferase activity was d^ved from an intact dicistronic 
10 mRNA. These results indicate that ICS2-17.2 (SEQ ID NO: 52) fimctions as an 
IRES. 

Five linked copies of both ICSl-23b and ICS2-17.2 also were exammed using 
the 5' UTR of a monocistronic reporter mRNA. In 7 cell lines tested, (ICSl-23b)5 

15 blocked translation by approximately 70% and (ICS2-17.2)5 slightly enhanced 
translation. In both cases, mRNA levels appeared to be unaffected. This result 
indicates that ICSl-23b and ICS2-17.2 function as IRES elements in the dicistronic 
mRNAs, but not as transcriptional promoters or enhancers. As with ICS 1 -23b, 
sequence comparisons identified a complementary match between ICS2-17.2 and 

20 1 8S rRNA with a BestFit quality score that is significantiy greater than that obtained 
with 10 randomized variations of the this seqiieni^^ 

The activity of the selected ICSl-23b (SEQ ID NO: 51) and ICS2-17.2 (SEQ 
ID NO: 52) IRES modules was examined in additional cell lines to determine whether 

25 they were active in cell types other than the B 104 neuroblastoma cells. A construct of 
five linked copies of each module was active in each of the cells line tested, including 
rat glioma C6 cells, human neuroblastoma SK cells, mouse neuroblastoma N2a cells, 
mouse NIH-3T3 fibroblasts, himian cervical carcinoma HeLa cells, normal rat kidney 
NRK cells, and mouse muscle myoblast C2C12 cells. The activities of these 

30 synthetic IRESes varied as much as ten-fold between cell Imes, and also varied with 
respect to each other. However, the pattern of activity of the ICS-23b (SEQ ID 
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NO: 5 1) module in the dififerent cell lines tested was similar to that observed for ten 
linked copies of the Gtx IRES module (SEQ ID NO: 2). 

These results demonstrate that relatively small discrete nucleotide sequences 
5 can act as translational regulatory elements, including as IRES elements, which 
mediate cap-independent translation. Furthermore, the two IRES modules identified 
in this Example were selected from only a minute sampling of the total complexity of 
the random oligonucleotides. Thus, it is likely that screening a more complex library 
of random oligonucleotide will identify additional short nucleotide sequences having 
10 IRES or other translational regulatory activity. 

It is remarkable that each of the short IRES element disclosed herein, 
mcluding the Gtx BRES (SEQ ID NO: 2), the ICSl-23b IRES (SEQ ID NO: 51), and 
the ICS2-17.2 IRES (SEQ ID NO: 52) can promote intemal initiation. Each of these 

1 5 three IRES modules contain a complementary match to different segments of 1 8S 
rRNA, suggesting that a direct interaction occurs between the IRES module and the 
40S ribosomal subunit via base pairing to 18S rRNA. Alternatively, one or more of 
the IRES modules may recruit 40S ribosomal subunits by interacting with a protein 
component of the translational machinery, for example, a ribosomal protein, an 

20 initiation fector, or some other bridging protein. The ability to initiate translation 
internally by binding to an initiation factor has been reported, wherein-aniron 
response element (IRE) and the bacteriophage X transcriptional anti-terminator box B 
element were botii demonstrated to function as IRESes in the presence of fusion 
proteins between the appropriate binding protein for these RNA elements and eIF4G 

25 (DeGregorio et al., EMBO J. 1 8:4865-4874, 1999). However, the lack of appreciable 
sequence shnilarities between the IRES modules disclosed herein and cellidar IRESes 
in general suggests that a wide variety of nucleotide sequences can fimction in 
intemal translation initiation, and suggests that different sequences recruit 
pre-initiation complexes by different mechanisms* 

30 
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The observation that synthetic IRESes comprising multimeis of ICS 1 -23b, 
ICS2-17.2, or the Gtx IRES module show enhanced IRES activity as compared to the 
corresponding monomers suggest that multiple copies of the IRES module may 
mcrease the probability of recruiting 40S ribosomal subunits. A similar observation 
5 has been made for eIF4G tethered to the IRE-binding protein, where there was an 

approximately linear increase in translation when the number of IRE binding sites was 
increased from one site to three (DeGregorio et al*, supra^ 1999). 

An arresting feature of cellular IRESes, including the disclosed IRES modules, 
10 is their variable potency in diflferent cell types. As such, selection for IRESes in a 
variety of cell lypes can provide a means to identify additional elements having cell- 
specific and tissue-specific activities. If ribosomal recruitment requires direct 
interaction of IRESes with 18S rRNA, variations in efficiency may reflect differences - 
in the accessibility of particular segments of 18S rRNA in diflferent cell types. 
1 5 Alternatively, some IRES modules may require or may be blocked by binding 

proteins that are differentially expressed in various cell types. Such possibilities can 
be distinguished by determining which proteins or components of the translation 
machinery bind to particular IRES sequences in various differentiated cells. In view 
of the modular nature of cellular IRES, combinations of synthetic IRESes can be 
20 constructed and elements having desirable regulatory actions can be selected. Such a ^ 
combinatorial approach can be used to construct synthetfcTlRESeS'Iiaving variable 
translational regulatory activity, for example, highly restricted or widespread 
translational activity. 

25 P. Additional Synthetic IRES Elements 

A library of synthetic oligonucleotides containing two random 9 nucleotide 
(Ran9) sequences also was examined for IRES activity. The Ran9 oligonucleotides 
were in a construct that contained, from the 5' to the 3* end, 62 nucleotides of the 
p-globm leader, a poly(A)9 sequence, the first Ran9 oligonucleotide, a poly(A)9 spacer, 

30 the second Ran9 oligonucleotide, a poly(A)9 sequence, and 44 nucleotides of the 

P-globin leader (see, for example, SEQ ID NO: 53), which includes specific nucleotides 
corresponding to the two Ran9 positions; see below). 
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Thiity-six of Ihe constructs that exhibited IRES activity are shovwi as SEQ ED 
NOS: 53 to 88. In addition, the individual IRES modules fiom ^e constructs are 
shown as SEQ ID NOS: 89 to 160. 

.5 

EXAMPLES 

Tp y^^NTTFTr ATTON OF I RVS AmVITy TN .y UTRs OF MAMMALIAN RWAs 

IRES elements also were identified in the 5' transcribed leader sequences of 
the mRNAs that encode the rat amyloid precursor protein (APP; Rogers et al., JLBioL 

10 QisnL 274:6421-643 1, 1999, which is incorporated herein by reference) and the 
beta-site APP cleaving enzyme (Bace; Yan et al., MstlflB 402:533-7, 1999, vMch is 
incorporated herein by reference). In addition, IRES activity was associated with the 
5' UTR of the human FMR mRNA. These IRES elements were identified and 
characterized in the intercistronic region of a dual luciferase didstronic mRNA, as 

15 disclosed above. 

The 5' transcribed leader sequence of the mRNA encoding rat APP is shown 
as SEQ ID NO: 161. The5' transcribed leader sequence of the mRNA encodmg rat 
Bace is shown as SEQ ID NO: 162. The 5' untranslated region of the human FMR 
20 mRNA is shown as SEQ ID NO: 163. Using the methods as disclosed in Examples 1 
and 6, translational regulatory elements of these 5' UTRs, including IRES modules, 
readily can be identified and isolated. 

EXAMPLE 6 

25 mENTiFTCA-nnN OF mvjs M omif E in mouse Rbm3 5'UTR 

IRES activity also was detected in the 728 nucleotide 5' UTR of the mouse 
Rbm3 mRNA (SEQ ID NO: 164, which includes ATG initiator methionine). In 
addition, ftagmemts of the Rbm3 5' UTR had IRES activity, including the following 
sequences: nucleotide 1 to 97, nucleotides 199 to 299, nucleotides 299 to 406, 

30 nucleotides 400 to 501, nucleotides 419 to 501, nucleotides 440 to 460, nucleotides 500 
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to 601, nucleotides 600 to 728, nucleotides 630 to 650, nucleotides 650 to 670 and 
nucleotides 690 to 710 of SEQ IDNO: 164. 

To further characterize the Rbm3 5' UTR, deletions were made from the 5' and 
5 3* ends and examined using the dicistronic RPh vector (see Example 1). Figure 6 shows 
the sequences of the various deletion constructs (SEQ ID NOS: 165 to 190; note that 
adenosine residues were added to 3' end of 3' deletion mutants to maintain relative 
distance to naturally occurring Rbm3 initiator AUG codon) and the IRES activity 
associated with each (shown as the R:P ratio, normalized to 1 ,0 for the activity of the 

10 parent RPh vector). The deletion studies indicated that IRES activity a core element 
comprising the sequence TTTATAATTTCrrTCrrTCCAGAA (SEQ ID NO: 191) 
conferred IRES activity. In addition, the sequence GAAAGCTTG (SEQ ID NO: 192), 
which is positioned beginning 8 nucleotides downstream of the core IRES element 
enhances the activity of the IRES module by about 10-fold, but does not have IRES 

1 5 activity on its own. These results demonstrate that a core IRES module is present in the 
RbmS 5' UTR and, together with the results disclosed above, indicate that small IRES 
modules can be present in a variety of eukaiyotic mRNA molecules. 

The deletion studies of the Rbm3 5' UTR also indicated that nucleotide 
20 sequences in the 5* UTR may inhibit IRES activity. Analysis of the deletion mutants 
revealed an Rbm3 IRES inhibitory sequence, GGTACATTTTTlll C (SEQ ID 
NO: 193), which is positioned about 21 nucleotides downstream of flie IRES module 
and inhibited internal initiation of translation facilitated by the nearby IRES module. 
In addition, a second inhibitory sequence, ATGCGGAATGATTTATAA (SEQ ID 
25 NO: 194), which partially overlaps the 5' end of the Rbm3 IRES module, contains a 
short open reading frame that inhibits intemal initiation only when the Rbm3 
initiation codon is mutated. This result indicates that a translational inhibitory 
element can be masked by an open reading frame. 
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EXAMPLE? 

TRES MODULES BASED ON RIBOSOMAL RNA STRUCTURE 
This example demonstrates tiiat synthetic oligonucleotides having IRES 
activity can be designed based on tbie structure of ribosomal RNA molecules. 

5 

It is not present clear how cellular IRESes function. As disclosed herein, 
however, cellular IRESes exist as modular structures composed of short, independent 
oligonucleotides, including oligonucleotide that are complementary to 188 rRNA. 
These results indicate that recruitment of ribosomal subunits by IRES modules is 
10 directed by base pairing of the IRES element to the rKNA within the ribosomal 
subunit 



The 9 nucleotide Gtx IRES module (SEQ ID NO: 2) is 100% complementary 
to an oligonucleotide sequence of 18S rRNA, and was tested as an IRES module 

1 5 based on this observation. In addition, the ability of the Otx IRES module (SEQ ID 
NO: 2) to recruit 40S ribosomal subunits by base pairing to 18S rRNA was 
examined. Nitrocellulose filter-binding and electrophoretic mobility gel shift assays 
established a physical link between the 9 nucleotide Gtx IRES module (SEQ ID 
NO: 2) and dissociated ribosomal subunits, but not with other components of cell 

20 lysates. Transfection studies using dicistronic constructs that contained the Gtx IRES 
module (SEQ ID NO: 2) or mutations of this sequence (see Figure 2) demonstrated 
that internal initiation was maximal with a mutant module sharing 7 nucleotides of 
complementarity with 18S rRNA, and that as the degree of complementarity was 
progressively increased or decreased, IRES activity was decreased and, ultimately, 

25 lost. When tested in the 5' or 3' UTR of a monocistronic mRNA, sequences lhat 

enhanced internal initiation also functioned as translational enhancers. However, only 
those sequences with increased complementarity to 18S rRNA inhibited both internal 
initiation and translation in monocistronic mRNAs. This inhibition appeared to 
involve stable interactions between the mRNA and 40S ribosomal subunits as 

30 determined by polysome analysis. These results indicate that internal initiation of 
translation can occur at short nucleotide sequences by base pairing to 18S rRNA. 
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Sequence analysis of the IRES-modules recovered from the selection studies 
showed that most of the selected sequences contained complementary sequence 
matches of 8 to 9 nucleotides to different regions of the 18S rRNA (Figure 7). 

5 Furthermore, many of the matches are to un-base paired regions of the rRNA (see 
Figure 7B). Moreover, in some cases, several selected sequences with slightly 
different sequences, were complementary to the same region of the 18S rRNA (see, 
also, Owens et al., 2001, which is incorporated herem by reference). These results 
indicate that synthetic translational regulatory elements can be designed based on 

1 0 rRNA sequences such as tiiose set forth in SEQ ID NOS: 1 95-1 97, particularly to 
un-base paired rRNA Sequences, which can be predicted usmg methods as disclosed 
herein, such that the synthetic translational regulatory elements are complementary to 
a selected rRNA target sequence. 



1 5 Although the invention has been described with reference to the above examples, 

it will be understood that modifications and variations are encompassed within the spirit 
and scope of the invention. Accordingly, the invention is limited only by the following 
claims. 
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What is claimed is: 

L A syiilhetictranslationalregulatDiy element, TO 
oligomicleotide cotisisting of about 6 to 125 ribonucleotides, or a deoxyribonucleotide 
5 sequence encoding said oligonucleotide, 

wherein said oligonucleotide has a translational regulatory activity selected ftom 
translational enhancing activity, translational inhibitory activity, internal ribosome entry 
site (IRES) activity and a combination thereof, and 

v^herein the synthetic translational regulatory element has translational regulatory 
10 activity in a eukaryotic cell. 

2. The synthetic translational regulatory element of claim 1, wherein said 
oligonucleotide consists of about 8 to 100 ribonucleotides. 

IS 3. The synthetic translational reguktory element of claim l,>^erein^ 

oligonucleotide consists of about 9 to SO ribonucleotides. 

4. The synthetic translational regulatory element of claim 1, wherein said 
oligonucleotide is encoded by a nucleotide sequence selected from any of SEQ ID 

20 NOS: 42 to 46, 49, 50, 52, and 89 to 160, 

5. The synthetic translational regulatory element of claim 1, comprising at least 
two of said oligonucleotides, 

wherein the at least two oligonucleotides are operatively linked to each other, 

25 and 

wherein each of the at least two oligonucleotides independentiy is the same or 
different from each other. 

6. The synthetic translational regulatory element of claim 5, wherein each of the 
30 at least two oligonucleotides is encoded by a nucleotide sequence selected from any of 

SEQ ID NOS: 42 to 46, 49, 50, 52, and 89 to 160. 
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7. The synQietic translational legulatoiy element of claim 5, comprising five of 
said oligonucleotides, wbich are operatively linked to each other. 

8. The synthetic translational regulatory element of claim 5, comprising ten of 
5 said oligonucleotides, y^ch are operatively linked to each other. 

9. The synthetic translational regulatory element of claim 5, comprising 
about 2 to 75 of said oligonucleotides, which are operatively linked to each other. 

10 10. The synthetic translational regdatory elemeiit of claim S, conq)risi^ 

10 to SO of said oligonucleotides, which are operadvely linked to each other. 

1 1 . The synthetic translational regulatory element of claim 5, wherein each of . - 
said at least two oligonucleotides is separated fix>m each other by a spacer nucleotide 

1 5 sequence, \^dierein said spacer nucleotide sequence consists of about 1 to 
100 ribonucleotides. 

12. The synthetic translational regulatory element of claim 5, which is encoded 
by a nucleotide sequence selected from any of SEQ ID NOS: 53 to 88. 

13. The synthetic translational regdatory element of claim l,wMch has IRES ^- 
activity. 

14. The synthetic translational regulatory element of claim 1 , which has 
. 25 translational enhancing activity. 

15. The synthetic translational regulatory client of claim 1, wherein said 
oligonucleotide is complementary to an oligonucleotide sequence of a ribosomal RNA. 

30 16. The synthetic translational regulatory element of claim 1 5, wherein said 

oligonucleotide is derived from an oligonucleotide encoded by any of SEQ ID NOS: 2, 
30, 32, 34, 36, 38, 40 and 50. 
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17. A vector, comprising the synthetic translational regulatory element of 
claim 1. 

5 18. The vector of claim 17, which is an expression vector. 

19. The expression vector of claim 18, which comprises a translation initiation 
site, wherein the translational regulatory element has translational enhancing activity. 

10 20. The expression vector ofclaim 19, fiirthercomprismg a translati^^ 

codon opemtively linked to Ihe translation initiation site. 

21. The expression vector of clahn 19, fiirther comprising an expressible 
polynucleotide, 

1 5 wherem the translational regulatory element is opemtively linked to the 

expressible polynucleotide, and 

wdierein tiie translational regulatory element has translational enhancing activity 
or IRES activity. 

20 22. The expression vector of claim 21 , wherein the expressible polynucleotide 

comprises at least one cistron. 

23. The e3q)iession vector of claim 22, wherein the expressible polynucleotide is 
polycistronic. 

25 

24. A host cell containing tiie synthetic translational regulatory element of 
claim 1. 
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25. An isolated translational regulatoiy element, comprising at least one 
5' untranslated region (5* UTR) of a eukaryotic messenger RNA (mRNA) or an 
oligonucleotide portion thereof, or a deoxyribonucleotide sequence encoding said 
5* UTR or oligonucleotide portion thereof, 

5 wherein said 5' UTR or oligonucleotide portion thereof has an activity selected 

from translational enhancing activity, internal ribosome entry site (ERES) activity, 
translational inhibitory activity, and a combination thereof, and 

wherein said translational regulatory client has translational regulatory activity 
in a eukaryotic cell. 

10 ' ^ 

26. The isolated translational regulatory element of claim 25, 

wherein said 5' UTR is encoded by a nucleotide sequence selected from any of 
SEQ ID NOS: 1, 23 to 29, 161, 162, and 164, and 

wherein said 5* UTR or oligonucleotide portion thereof has an activity selected 
1 5 from translational enhancing activity and IRES activity. 

27. The isolated translational regulatory element of claim 26, wherein said 
oligonucleotide portion is encoded by a nucleotide sequence selected from any of 
nucleotides 1 to 40, 1 to 81, 1 to 120, 41 to 81, 14 to 196, 80 to 120, 80 to 196, 120, to 

20 166, and 120 to 196 of SEQ ID NO: 1. 

28. The isolated translational regulatory element of claim 26, herein said 
oligonucleotide portion is encoded by SEQ ID NO: 2. 

25 29. The isolated translational regulatory element of claim 28, herein said 

oligonucleotide portion encoded by SEQ ID NO: 2 is Imked at its 5' end or 3' end or both 
5* and 3* ends to a spacer nucleotide sequence, v^erein said spacer nucleotides sequence 
consists of about 1 to 100 ribonucleotides. 



30 



30. The isolated translational regulatory element of claim 29, which is encoded 
by a nucleotide sequence selected from SEQ ID NOS: 4, 5, 12, and 20 to 22. 
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31. The isolated tiaiislational regulatory elanento^ 
two 5* UTRs or oligonucleotide portions thereof, 

wherein tiie at least two 5* UTRs or oligonucleotide portions thereof are 
operatively linked to each other, and 
5 wherem each of the at least two 5* UTRs or oligonucleotide portions tiiereof 

independentiy is the same or different from each other. 

32. The isolated translational regulatory element of claim 31, comprising at least 
two oligonucleotide portions encoded by SEQ ID NO: 2, which are operatively linked to 

10 each other. 

33. The isolated translational regulatory element of claim 31, comprismg five 
oligonucleotide portions encoded by SEQ ID NO: 2, which are operatively linked to 
each other. 

15 

34. The isolated translational regulatory element of claim 31, comprisii^ ten 
oUgonucleotide portions encoded by SEQ ID NO: 2, which are operatively Imked to 
each other. 

20 35. The synthetic translational regulatory element of claim 3 1 , comprising about 

2 to 75 of said-§^=t^ERs or oligonucleotide portions thereof, which are operatively linked 
to each other. 

36. The synthetic translational regulatory element of claim 3 1 , comprismg about 
25 10 to 50 of said 5* UTRs or oligonucleotide portions thereof, which ace operatively 

linked to each other, 

37. The isolated translational regulatory element of claim 31, wherein each of 
the at least two 5' UTRs or oligonucleotide portions thereof is separated from each otiier 

30 by a spacer nucleotide sequence, wiierein said spacer nucleotide sequence consists of 
about 1 to 100 ribonucleotides. 
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38. The isolated translational regulatoty element of claim 37, consisting of two 
oligonucleotide portions encoded by SEQ ID NO: 2. 

39. The isolated translational regulatory element of claim 38, which is encoded 
5 by a nucleotide sequence selected from any of SEQ ID NOS: 6 to 11 and 13 to 15. 

40. The isolated translational regulatory element of claim 26, wherein said 
oligonucleotide portion is encoded by a nucleotide sequence selected from any of 
nucleotides 1 to 250, 100 to 508, 160 to 508, 250 to 508, 375 to 508, 429 to 508, 481 

10 to 508, and 250 to 390 of SEQ ID NO: 29. 

41 . The isolated translational regulatory element of claim 26, wherein said 
oligonucleotide portion is encoded by a nucleotide sequence selected fiom any of 
SEQ ID NOS: 165 to 169, 171 to 176, and 182 to 190. 

15 

42. The isolated translational regulatory element of claim 26, wherein said 
oligonucleotide portion is encoded by a nucleotide sequence selected from SEQ ID 
NOS: 191 and 192. 

20 43 . The isolated translational regulatory element of claim 25, 

wherein said 5* UTR is encoded by a hucleotide sequence selected from any of 
SEQ ID NOS: 1, 23 to 29, 161, 162, and 164, and 

wherein said oligonucleotide portion thereof has translational inhibitory activity. 

25 44. The isolated translational regulatory element of claim 43, wherein said 

oligonucleotide portion is encoded by a nucleotide sequence selected from 
nucleotides 120 to 196 and 167 to 196 of SEQ ID NO: 1. 

45. The isolated translational regulatory element of claim 43, wherein said 
30 oligonucleotide portion is encoded by a nucleotide sequence selected from 
nucleotides 1 to 100 and 100 to 160 of SEQ ID NO: 29. 
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46. The isolated tianslational legulatory element of claim 43, wherein said 
oli^QUcleotide portion is ^icoded by a nucleotide sequence selected tcom SEQ ID 
NO: 193 and SEQ ID NO: 194. 



5 47. The isolated translational regulatory element of claun 25, wherein said 

5' UTR or oligonucleotide portion thereof is complementary to an oligonucleotide 
sequence of a ribosomal KNA. 

48. The isolated translational regulatory element of claim 47, wherem said 

1 0 oligonucleotide portion is encoded by any of SEQ ID NOS: 2, 30, 32, 34, 36, 38, 40 
and 50. 

49. A vector, comprising the translational regulatory element of claim 25. 
15 50. The vector of claun 49, which is an e5q)ression vector. 

5 1 . The e2q)ression vector of claim 49, further comprising an e3q)ressible 
polynucleotide, 

wherein the translational regulatory element is opemtively luiked to the 
20 expressible polynucleotide, and 

wherein the4r£Hislatiraal regulatory element has transcriptional enhancing 
activity or IRES activity. 

52. A host cell containing the translational regulatory element of claim 25. 

25 

53. A kit, comprising the synthetic translational regulatory element of claim 1 . 



30 



54. The kit of claun 53, comprising a plurality of synthetic translational 
regulatory elements, which are the same as or different fiom each other. 
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55. The kit of claim 54, wherein each of the synthetic translational regulatory 
elements in the plurality comprises a flanking sequence independently at a 5' end or a 
3' end or both 5' and 3' ends, and 

wherein the flanking sequences provide a means to operatively link two or more 
5 synthetic translational regulatory elements in the plurality to each other. 

56. A recombinant nucleic acid molecule, comprising the synthetic translational 
regulatory element of claim 1 operatively linked to an expressible polynucleotide. 

1 0 57. Ibe recombmant nucleic add molecule of claim 56, \^erein the expressible 

polynucleotide comprises a cistron, and wherem the synthetic translational regulatory 
element has translational enhancing activity or IRES activity. 



58. The recombinant nucleic acid molecule of claim 56, i?vherein the ^cpressible 
1 5 polynucleotide comprises, in operative linkage in a 5' to 3' orientation, a first cistron, a 

spacer nucleotide sequence, and a second cistron, 

wherein the synthetic translational regulatory element is has IRES activity, and 
wherein the synthetic translational regulatory element is operatively linked to the 

second cistron. 

20 

' ^ ^' 5 9r"TWrecombuiant nucleic acid molecule of claim 56, wherein the synthetic 
translational regulatory element comprises at least two operatively linked 
oligonucleotides havmg translational regulatory activity. 

25 60. The recombinant nucleic acid molecule of claim 56, wherein the expressible 

polynucleotide encodes at least one polypeptide. 

61 . The recombinant nucleic acid molecule of claim 60, wherein the at least one 
polypeptide is an enzyme. 



30 
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62. Ilie rccombinant nucleic acid molecule of claim 61, w 

selected fiom ^galactosidase, ^-glucuronidase, luciferase, alkaline phosphatase, 
glutathione S-transferase, chloramphenicol acetyltransferase, guanine xanthine 
phosphoribosyltransferase, and neomycin phosphotransferase. 

5 

63. The recombinant nucleic acid molecule of claim 60, wherein the at least 
one polypeptide is selected from a viral polypeptide and a bacterial polypeptide. 

64. The recombinant nucleic acid molecule of claim 60, wherein at least on e 
10 the polypeptide comprises an epitope e^ressed by a pathogenic organism. 

65. The recombinant nucleic acid molecule of claim 60, vdierein the at least 
one polypeptide is selected from a growth factor, a hormone and a receptor for a 
growth factor or a hormone. 

15 

66. The recombinant nucleic acid molecule of claim 60, wherein the 
expressible polynucleotide encodes two polypeptides. 

67. A method of producing a genetically modified cell that exhibits altered 
20 e:q)ression of a polypeptide, the method comprising introducing a synthetic 

translational regulatory element^of-daim 1 into a cell, whereby the synthetic 
translational regulatory element opemlively linked to a nucleotide sequence encoding 
a polypeptide, thereby producing a genetically modified cell that exhibits altered 
e>q)ression of a polypeptide. 

25 

68. The method of claim 67, wherein the synthetic translational regulatory 
element is operatively linked to the nucleotide sequence prior to introducmg the 
synthetic translational regulatory element into the cell, wherein the synthetic 
translational regulatory element has translational enhancing activity or IRES activity. 

30 

69. The method of claim 67, wherein the polypeptide is a reporter 
polypeptide. 
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70. The metitod of claim 67, wherein the polypeptide is a toxin. 

71 . The method of claim 67, wherein the polypeptide is a therapeutic agent 

5 

72. The method of claim 66, wherein the synthetic translational regulatory 
element is stably maintained in the cell. 

73. The method of claim 66, wherein the synthetic translational regulatory 
10 element is integrated in the cell genome. 

74. The method of claim 73, \sdierein the nucleotide sequence is a sequence of 
an endogenous gene in the cell genome. 

15 75. The method of claim 73, wherein the synthetic translational regulatory 

element has translational inhibitory activity. 

76. The method of claim 73, wherein the synthetic translational regulatory 
element has translational enhancing activity or IRES activity 

20 

77. A-g^aSai^modified eukaryotic cell produced by the method of 
claim 73. 

78. A transgenic non-human eukaryotic organism, comprising the genetically 
25 modified cell of claun 77. 

79. A cell or tissue obtained &om the transgenic non-human eukaryotic 
organism of claim 78. 

30 80. A cDNA or genomic DNA library prepared from the transgenic 

non-human exikaryotic organism of claim 78, or from a cell or tissue obtained from 
said transgenic non-human eukaryotic organism. 
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81. A mefiiodofalteringtraQsMonal activism 

comprising mtrodudng inlo the cell a synthetic translational regulatory element of 
claim 1, whoreby the synthetic translational regulatory element interacts with a 
5 translation regulatory factor in the cell, thereby altering translational activity in the 
eukaryotic cell. 

82. A method of altering translational activity in a eukaryotic cell, the method 
comprising introducing into the cell an isolated translational regulatory element of 

10 clahn 25, whereby the isolated translational regulatory element interacts with a 
translation regulatory factor in the cell, thereby altering translational activity in the 
eukaryotic cell. 

83. Ihe method ofclaim 82, v^erein the isolated translational regulatory 
15 element comprises an isolated translational regulatory element of claim 25, whereby 

translational activity in the cell is decreased. 

84. The metiiod of claim 83, wherein the isolated translational regulatory 
element comprises a synthetic translational regulatory element of claim 43, whereby 

20 translational activity in the cell is decreased. 

85. A method of improving protein yield by a eukaryotic cell, comprising 
introducing into the cell a recombinant nucleic acid molecule comprising a synthetic 
translational regulatory element of claim 1 operatively linked to an expressible 

25 polynucleotide, 

wherein the translational regulatory element has an activity selected from 
translational enhancing activity and IRES activity, and 

wherein the expressible polynucleotide is expressed in the cell. 



30 



86. The method of claim 85, wherem tiie expressible polynucleotide 
comprises a first cistron encoding a polypeptide that enhances protein stability or cell 
viability. 
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87. The method of claim 86, wherein the expressible polymicleotide further 
comprises a second cistron encoding a polypeptide of interest, 

wherein the second cistron is operatively linked to the first cistron, and 
5 wherein the expressible polynucleotide comprises an IRES element, which is 

operatively linked to the first cistron or the second cistron or both, 

88. The method of claim 86, wherein the polypeptide that enhances protein 
stabiUty or cell viability is selected from a chaperone.protein, a heat shock protein, a 

1 0 proteui having anti-oxidant activity, a protease inhibitor, and a phosphatase inhibitor. 



89. The method of claim 86, wherein the polypeptide that enhances protein 
stability or cell viability is selected fix>m a caspase inhibitor, a magainin, a defensin, 
and a cryptdin. 

15 

90. The method of claim 87, wherein the translational regulatory element 
comprises a plurality of operatively linked oligonucleotide having translational 
enhancing activity or IRES activity or both. 



20 91 . The method of claim 85, wherein the expressible polynucleotide 

comprises a first cisiBrdffSico<SS^ a polypeptide that eiihances transcription or 
translation of a polynucleotide in the cell. 



92. The method of claim 91 , wherein the expressible polynucleotide further 
25 comprises a second cistron encoding a polypeptide of interest, 

wherein the second cistron is operatively linked to the first cistron, 
wherein the expressible polynucleotide comprises an IRES element, which is 
operatively linked to the first cistron or the second cistron or both, and 

and wherein the a polypeptide that enhances transcription or translation of a 
30 polynucleotide is a polypeptide that enhances transcription or translation of the 
expressible polynucleotide. 
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93. The method of claim 91, wherein the polypq)tide that enhances 
transcription or translation of a polynucleotide in the ceU is selected from a 
transcription factor and a translation regulatory factor. 

5 94. The method of claim 93, wherein the polypeptide that enhances 

transcription or translation of a polynucleotide in the cell is a eukaryotic initiation 
factor. 

95. A method of expressing a polypeptide in a eukaiyotic cell, comprising: 
10 a) introducing into the cell a recombinant nucleic acid molecule 

comprising a synthetic translational regulatory element of claim loperatively 
linked to an expressible polynucleotide, 

wherein the translational regulatory element has an activity selected 
from translational enhancing activily and IRES activity, and 
15 b) caressing the expressible polynucleotide in the cell. 

96. The method of claim 95, wherein the expressible polynucleotide 
comprises a first cistron encoding a therapeutic polypeptide. 

20 97. The method of claim 96, wherem the therapeutic polypeptide is selected 

— fiOTmrimmunomodulator, a neuromodulator, a hormone, a growth factor, a growth 
factor receptor, an apoptotic polypeptide, an anti-apoptotic polypeptide, and an 
antibiotic. 

25 98. The method of claim 96, wherein the expressible polynucleotide fiirther 

comprises a second cistron encoding a polypeptide that facilitates expression or 
activity of the therapeutic polypeptide, 

wherein the second cistron is operatively linked to the first cistron, and 
wherein the expressible polynucleotide comprises an IRES element, which is 
30 operatively linked to the first cistron or the second cistron or both. 
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99. The method of claim 98, polypeptide that facilitates expression or activity 
of the therapeutic polypeptide is selected fix)m a transcription factor that increases 
transcriptibri'of ihe e^qiressible polynucleotide, atPanslational regulatory factc^r that 
increases translation of a polypeptide encoded by the expressible polynucleotide, a 

5 chaperone protein, a protein having anti-oxidant activity, a protease inhibitor, and a 
phosphatase inhibitor. 

100. The method of claim 95, wherein introducing the expressible 
polynucleotide is cell is performed ex vivo or in vivo. 

10 

101 . The method of claim 95, wherein the recombinant nucleic acid molecule 
is contained in a vector. 

102. The method of claim 101, wherein the vector is a viral vector. 

15 

103. The method of claim 102, wherein the viral vector is selected from an 
adenovirus vector, an adeno-associated virus vector, and a retrovirus vector. 

104. The method of claim 95, wherein the expressible polynucleotide encodes 
20 a reporter polypeptide, 

105. The method of claim 104, wherein e3q)ression of the reporter polypeptide 
is detectable in a cell ex vivo or in vivo. 

25 106. The method of claim 104, wherein expression of the reporter polypeptide 

provide a means to diagnose or monitor the progression of a pathologic condition. 
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107. A method of identifying a ceU, comprising introdu^ 
recombinant nucleic acid molecule comprising a synthetic translational regulatory 
element of claim 1 operatively linked to ah e^ressible polynucleotide, 

wherein the translational regulatory element has an activity selected fix)m 
5 translational enhancing activity and IRES activity, and 

wherein the expressible polynucleotide comprises at least one cistron, vMch 
encodes a first reporter polypeptide, and 

wherein expression of the reporter polypeptide in the cell provides a means to 
identify the cell 

10 

108. The method of claim 107, wherein the expressible polynucleotide further 
comprises a second cistron encoding a polypeptide of interest, 

v^erein the second cistron is operatively linked to the first cistron, and 
wherein the expressible polynucleotide comprises an IRES element, which is 
1 5 operatively linked to the first cistron or the second cistron or both. 

109. The method of claim 108, wherein the polypeptide of interest is a second 
reporter polypeptide. 

20 110. The method of claim 1 09, wherein the second reporter polypeptide is 

different firom-the first reporter polypeptide. 

111. The method of claim 107, fiirther comprising isolating a cell e3q>ressing 
a reporter polypeptide. 

25 

1 12. The method of claim 111, wherein the reporter polypeptide is a cell 
surface marker, and wherein isolating the cell e3q)ressing the cell surface marker 
comprises using an antibody that specifically binds the cell surface marker. 

113. The method of claim 112, wherem the cell surface marker comprises a 
30 peptide tag operatively linked to a cell surface protein. 

1 14. An isolated cell obtained by the method of claim 111. 
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SEQUENCE LISTING 

<110> THE SCRIPPS RESEARCH INSTITUTE 
THE NEUROSCIENCES INSTITUTE 

<120> SYNTHETIC INTERNAL RIBOSOME ENTRY SITES AND METHODS OP 
IDENTIFYING SAME 

<130> SCRIP1360WO 

<140> HEREWITH 
<141> 2001-01-26 

<150> US 60/230,956 
<151> 2000-09-07 

<150> US 60/230,852 
<151> 2000-09-07 

<150> US 60/207,804 
<151> 2000-05-30 

<150> US 60/186,496 
<151> 2000-03-02 

<150> US 60/178,816 

<151> 2000-01-28 

<150> US (attorney docket SCRIP1370) 

<151> 2001-01-12 

<160> 197 

<170> Patentin version 3.0 

<210> 1 
<211> 196 
<212> DNA 

<213> Mus musculus . - . ^ - 

<400> 1 

ccgccgccca ttcagcgcaa cagccgtcgg tcctctcgct ttcccgtagg ggccgtcggc 60 
gttcgtttga aacgcggtcc acccgtccca gcgtagccgg cgctcttcgg cgccgcgcgc 120 
aaacttcccg agccggcggg tgcgggcggt ggcagcgggg cccggatggg cgcccgggtc 180 
ggaggcggcg gcgccc 



<210> 2 

<211> 9 

<212> DNA 

<213> Mus musculus 

<400> 2 
ccggcgggt 
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<210> 3 

<211> 51 

<212> DNA 

<213> Mus musculus 

<400> 3 

acatttgctt ctgacatagt tgtgttgact cacaacccca gaaacagaca t 51 



<210> 4 

<211> 51 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> Gtx IRES inbedded in beta-globin 5* UTR 

<400> 4- 

acatttgctt ctgacatccg gcgggtgact cacaacccca gaaacagaca t 51 



Gtx IRES followed by 25 nt beta-globin 



<210> 5 

<211> 34 

<212> DNA 

<213> ARTIFICIAL 

<220> 
<223> 

<400> 5 

ccggcgggtg actcacaacc ccagaaacag acat 



34 



<210> 6 
<211> 115 
<212> DNA 
<213> ARTIFICIAL 

<220> ^^-^^ — - . 

<223> five repeats of 'the 9 nt Gtx IRES-element interspersed with 9 nt 
spacers followed by 25 nt of the beta globin 5' UT 

<400> 6 

ttctgacatc cggcgggttt ctgacatccg gcgggtttct gacatccggc gggtttctga 60 
catccggcgg gtttctgaca tccggcgggt gactcacaac cccagaaaca gacat 115 



<210> 7 

<211> 202 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> ten repeats of the 9 nt Gtx IRES-element interspersed with 9 nt 
spacers followed by 25 nt of the beta globin 5' UT 

<400> 7 

ccggcgggtt tctgacatcc ggcgggtttc tgacatccgg cgggtttctg acatccggcg 60 
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ggtttctgac atccggcggg tgaattcttc tgacatccgg cgggtttctg acatccggcg 120 

ggtttctgac atccggcggg tttctgacat ccggcgggtt tctgacatcc ggcgggtgac 180 

tcacaacccc agaaacagac at 202 



<210> 8 

<211> 115 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> five repeats of the 9 nt Gtx IRES-element interspersed with 9 nt 
spacers followed by 25 nt of the beta globin 5' UT 

<400> 8 

agttgtgttc cggcgggtag ttgtgttccg gcgggtagtt gtgttccggc gggtagttgt 60 
gttccggcgg gtagttgtgt tccggcgggt gactcacaac cccagaaaca gacat 115 

<210> 9 

<211> 202 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> ten repeats of the .9 nt Gtx IRES-element interspersed with 9 nt 
spacers followed by 25 nt of the beta globin 5« UT 

<400> 9 

ccggcgggta gttgtgttcc ggcgggtagt tgtgttccgg cgggtagttg tgttccggcg 60 

ggtagttgtg ttccggcggg tgaattcagt tgtgttccgg cgggtagttg tgttccggcg 120 

ggtagttgtg ttccggcggg tagttgtgtt ccggcgggta gttgtgttcc ggcgggtgac 180 

tcacaacccc agaaacagac at ^ ^9^ 



<210> 10 

<211> 115 

<212> DNA 

<213> ARTIFICIAL 



<220> . ^ 

<223> five repeats of the 9 nt Gtx IRES-element interspersed with 9 nt 
spacers followed by 25 nt of the beta globin 5» UT 

<400> 10 

aaaaaaaaac cggcgggtaa aaaaaaaccg gcgggtaaaa aaaaaccggc gggtaaaaaa 60 
aaaccggcgg gtaaaaaaaa accggcgggt gactcacaac cccagaaaca gacat 115 



<210> 11 
<211> 202 
<212> DNA 
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<213> ARTIFICIAL 



<220> 

<223> ten repeats of the 9 nt Gtx IRES-element interspersed with 9 nt 
spacers followed by 25 nt of the beta globin 5' UT 



<400> 11 






ccggcgggta 


aaaaaaaacc ggcgggtaaa aaaaaaccgg cgggtaaaaa aaaaccggcg 


60 


ggtaaaaaaa 


aaccggcggg tgaattcaaa aaaaaaccgg cgggtaaaaa aaaaccggcg 


120 


ggtaaaaaaa 


aaccggcggg taaaaaaaaa ccggcgggta aaaaaaaacc ggcgggtgac 


180 


tcacaacccc 


agaaacagac at 


202 



<210> 12 

<211> 106 

<212> DNA 

<213> ARTIFICIAL 



<220> 

<223> eight repeats of the 9 nt spacer, upstream of the 9 nt Gtx IRES-e 
lement, followed by 25 nt of the b-globin 5' UT 

<400> 12 

ttctgacatt tctgacattt ctgacatttc tgacatttct gacatttctg acatttctga 60 
catttctgac atccggcggg tgactcacaa ccccagaaac agacat 106 



<210> 13 / 

<211> 106 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> six repeats of the 9 nt spacer, upstream of two repeats of the 9 
nt Gtx lRES-eleme'ht~spa:ced* by the 9^ nt spacer, followed by the 2 
5 nt of the beta-globin 5 • UTR 

<400> 13 

ttctgacatt tctgacattt ctgacatttc tgacatttct gacatttctg acatccggcg 60 
ggtttctgac atccggcggg tgactcacaa ccccagaaac agacat 106 



<210> 14 

<211> 106 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> four repeats of the 9 nt spacer, upstream of two repeats of the 9 
nt Gtx IRES -element spaced 27 nt apart with the 9 nt spacer foil 
owed by the 25 nt beta-globin 5 ' UT 

<400> 14 ' 
ttctgacatt tctgacattt ctgacatttc tgacatccgg cgggtttctg acatttctga 60 



J 
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catttctgac atccggcggg tgactcacaa ccccagaaac agacat 106 

<210> 15 

<211> 106 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> one repeat of the 9 nt spacer, upstream of two repeats of the 9 n 
t Gtx IRES -element spaced 54 nt apart with the 9 nt spacer follow 
ed by the 25 nt beta-globin 5' VT 

<400> 15 

ttctgacatc cggcgggttt ctgacatttc tgacatttct gacatttctg acatttctga 60 
catttctgac atccggcggg tgactcacaa ccccagaaac agacat 106 

<210> 16 

<21X> 6250 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> VECTOR 
<400> 16 

gaattctcat gtttgacagc ttatcatcga ttagtccaat ttgttaaaga caggatatca 60 

gtggtccagg ctcagttttg actcaacaat atcaccagct gaagcctata gagtacgagc 120 

catagataga ataaaagatt ttatttagtc tccagaaaaa ggggggaatg aaagacccca 180 

cctgtaggtt tggcaagcta gaaatgtagt cttatgcaat acacttgtag tcttgcaaca 240 

tggtaacgat gagttagcaa catgccttac aaggagagaa aaagcaccgt gcatgccgat 300 

tggtggaagt aaggtggtac gatcgtgcct tattaggaag gcaacagaca ggtctgacat 360 

ggattggacg aaccactcta gagaaccatc agatgtttcc agggtgcccc aaggacctga 420 

aaatgaccct gtgccttatt tgaactaacc aatcagttcg cttctcgctt ctgttcgcgc 480 

gcttctgctc cccgagctca ataaaagagc ccacaacccc tcactcggcg cgccagtcct 540 

ccgattgact gcgtcgcccg ggtacccgta ttcccaataa agcctcttgc tgtttgcatc 600 

cgaatcgtgg actcgctgat ccttgggagg gtctcctcag attgattgac tgcccacctc 660 

ggggtctttc atttggaggt tccaccgaga tttggagacc ccagcccagg gaccaccgac 720 

ccccccgccg ggaggtaagc tggccagcgg tcgtttcgtg tctgtctctg tctttgtgcg 7 80 

tgtttgtgcc ggcatctaat gtttgcgcct gcgtctgtac tagttagcta actagctctg 840 

tatctggcgg acccgtggtg gaactgacga gttctgaaca cccggccgca accctgggag 900 

acgtcccagg gactttgggg gccgtttttg tggcccgacc tgaggaaggg agtcgatgtg 960 
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gaatccgacc ccgtcaggat atgtggttct ggtaggagac gagaacctaa aacagttccc 1020 

gcctccgtct gaatttttgc tttcggtttg gaaccgaagc cgcgcgtctt gtctgctgca 1080 

gcatcgttct gtgttgtctc tgtctgactg tgtttctgta tttgtctgaa aattagggcc 1140 

agactgttac cactccctta agtttgacct taggtcactg gaaagatgtc gagcggatcg 1200 

ctcacaacca gtcggtagat gtcaagaaga gacgttgggt taccttctgc tctgcagaat 1260 

ggccaacctt taacgtcgga tggccgcgag acggcacctt taaccgagac ctcatcaccc 1320 

aggttaagat caaggtcttt cacctggccc gcatggacac ccagaccagg tcccctacat 1380 

cgtgacctgg gaagccttgg cttttgaccc ccctccctgg gtcaagccct ttgtacaccc 1440 

taagcctccg cctcctcttc ctccatccgc cccgtctctc ccccttgaac ctcctcgttc 1500 

gaccccgcct cgatcctccc tttatccagc cctcactcct tctctaggcg ccggaattcg 1560 

ttcatggtga gcaagggcga ggagctgttc accggggtgg tgcccatcct ggtcgagctg 1620 

gacggcgacg taaacggcca caagttcagc gtgtccggcg agggcgaggg cgatgccacc 1680 

tacggcaagc tgaccctgaa gttcatctgc accaccggca agctgcccgt gccctggccc 1740 

accctcgtga ccaccctgac ctacggcgtg cagtgcttca gccgctaccc cgaccacatg 1800 

aagcagcacg acttcttcaa gtccgccatg cccgaaggct acgtccagga gcgcaccatc 1860 

ttcttcaagg acgacggcaa ctacaagacc cgcgccgagg tgaagttcga gggcgacacc 1920: 

ctggtgaacc gcatcgagct gaagggcatc gacttcaagg aggacggcaa catcctgggg 1980 

cacaagctgg agtacaacta caacagccac aacgtctata tcatggccga caagcagaag 2040 

aacggcatca aggccaactt caagacccgc cacaacatcg aggacggcgg cgtgcagctc 2100 

gccgaccact accagcagaa cacccccafec* ggcgacggcc ccgtgctgct gcccgacaac 2160 

cactacctga gcacccagtc cgccctgagc aaagacccca acgagaagcg cgatcacatg 2220 

gtcctgctgg agttcgtgac cgccgccggg atcactctcg gcatggacga gctgtacaag 2280 

taaagcggcc gcgactctag agtcgaggat ccgctagcta gttaattaat cgcgacgacg 2340 

cgtcgccatg gtgagcaagg gcgaggagct gttcaccggg gtggtgccca tcctggtcga 2400 

gctggacggc gacgtaaacg gccacaagtt cagcgtgtcc ggcgagggcg agggcgatgc 2460 

cacctacggc aagctgaccc tgaagttcat ctgcaccacc ggcaagctgc ccgtgccctg 2520 

gcccaccctc gtgaccaccc tgacctgggg cgtgcagtgc ttcagccgct accccgacca 2580 

catgaagcag cacgacttct tcaagtccgc catgcccgaa ggctacgtcc aggagcgcac 2640 

catcttcttc aaggacgacg gcaactacaa gacccgcgcc gaggtgaagt tcgagggcga 2700 

caccctggtg aaccgcatcg agctgaaggg catcgacttc aaggaggacg gcaacatcct 2760 
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ggggcacaag ctggagtaca actacatcag ccacaacgtc tatatcaccg ccgacaagca 2820 

gaagaacggc atcaaggcca acttcaagat ccgccacaac atcgaggacg gcagcgtgca 2880 

gctcgccgac cactaccagc agaacacccc catcggcgac ggccccgtgc tgctgcccga 2940 

caaccactac ctgagcaccc agtccgccct gagcaaagac cccaacgaga agcgcgatca 3000 

catggtcctg ctggagttcg tgaccgccgc cgggatcact ctcggcatgg acgagctgta 3060 

caagtaagtc gacggtatcg ataaaataaa agattttatt tagtctccag aaaaaggggg 3120 

gaatgaaaga ccccacctgt aggtttggca agctagaatg cataaatgta gtcttatgca 3180 

atacacttgt agtcttgcaa catggtaacg atgagttagc aacatgcctt acaaggagag 3240 

aaaaagcacc gtgcatgccg attggtggaa gtaaggtggt acgatcgtgc cttattagga 3300 

aggcaacaga caggtctgac atggattgga cgaaccacta gatctgaagg ggggctataa 3360 

aagcgatgga tccgagctcg gccctcattc tggagactct agaggccttg aattcgcggc 3420 

cgcgccagtc ctccgattga ctgcgtcgcc cgggtaccgt gtatccaata aaccctcttg 3480 

cagttgcatc cgacttgtgg tctcgctgtt ccttgggagg gtctcctctg agtgattgac 3540 

tacccgtcag cgggggtctt tcatttgggg gctcgtccgg gatcgggaga cccctgccca 3600 

gggaccaccg acccaccacc gggaggtaag ctggctgcct cgcgcgtttc ggtgatgacg 3660 

gtgaaaacct ctgacacatg cagctcccgg agacggtcac agcttgtctg taagcggatg 3720 

ccgggagcag acaagcccgt cagggcgcgt cagcgggtgt tggcgggtgt cggggcgcag 3780 

ccatgaccca gtcacgtagc gatagcggag tgtatactgg cttaactatg cggcatcaga 3840 

gcagattgta ctgagagtgc accatatgtc cgcccatccc gcccctaact ccgcccagtt 3900 

ccgcccattc tccgccccat ggctgactaa ttttttttat ttatgcagag gccgaggccg 3960 

cctcggcctc tgagctattc cagaagtagt gaggaggctt ttttggaggc ctaggctttt 4020 

gcaacatatg tccgcccatc ccgcccctaa ctccgcccag ttccgcccat tctccgcccc 4080 

atggctgact aatttttttt atttatgcag aggccgaggc cgcctcggcc tctgagctat 4140 

tccagaagta gtgaggaggc ttttttggag gcctaggctt ttgcaacata tgcggtgtga 4200 

aataccgcac agatgcgtaa ggagaaaata ccgcatcagg cgctcttccg cttcctcgct 4260 

cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc actcaaaggc 4320 

ggtaatacgg ttatccacag aatcagggga taacgcagga aagaacatgt gagcaaaagg 4380 

ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg 4440 
cccccctgac gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg 4500 
actataaaga taccaggcgt ttccccctgg aagctccctc gtgcgctctc ctgttccgac 4560 
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cctgccgctt 


accggatacc 


tgtccgcctt 


tctcccttcg 


ggaagcgtgg cgctttctca 


4620 


tagctcacgc 


tgtaggtatc 


tcagttcggt 


gtaggtcgtt 


cgctccaagc tgggctgtgt 


4680 


gcacgaaccc 


cccgttcagc 


ccgaccgctg 


cgccttatcc 


ggtaactatc gtcttgagtc 


4740 


caacccggta 


agacacgact 


tatcgccact 


ggcagcagcc 


actggtaaca ggattagcag 


4800 


agcgaggtat 


gtaggcggtg 


ctacagagtt 


cttgaagtgg 


tggcctaact acggctacac 


4860 


tagaaggaca gtatttggta 


tctgcgctct gctgaagcca 


gttaccttcg gaaaaagagt 


4920 


tcroitacfctet 




aacaaaccac 


cgctggtagc 


ggtggttttt ttgtttgcaa 


4980 


erf acffaaat" i" 




aaaaaggatc 


tcaagaagat 


cctttgatct tttctacggg 


5040 




f a cf t* crcf a a c^d 


aaaactcacg ttaagggatt 


ttggtcatga gattatcaaa 


5100 


a St rrrra ^ t* i* <^ 
dcivj ci U ^ U U ^ 


d (.« U ciy d L> w ^ 


ttttaaatta 


aaaatgaagt 


tttaaatcaa tctaaagtat 


5160 


a t* J4 ^ ftt* sst a 

Cl wCLIoV^Cl^ loCLCl 




acagttacca atgcttaatc 


agtgaggcac ctatctcagc 


5220 • 


rta ^ r>^ a 


I- 1" h r« o t* h r* a h 


ccatagttgc 


ctgactcccc 


gtcgtgtaga taactacgat 


5280 


acgggagggc 




gccccagtgc 


tgcaatgata 


ccgcgagacc cacgctcacc 


5340 


^ vj u i«> a u 




taaaccagcc 


agccggaagg 


gccgagcgca gaagtggtcc 


5400 




L> \^ ^ ^ L. d 


tccagtctat 


taattgttgc 


cgggaagcta gagtaagtag 


5460 


l« L> w M w ^ ^3 ^ ^ 


a at" aot* ^ctr* 


gcaacgttgt 


tgccattgct 


gcaggcatcg tggtgtcacg 


5520 


nt" fcri* ocrttt 

W U ^3 ^ ^3 Iv ^ i« 


cral' a t" cicir' i" t" 
yy i*duyy»-» u i* 


cattcagctc cggttcccaa 


cgatcaaggc gagttacatg 


5580* 


atcccccatg 


ttgtgcaaaa 


aagcggttag ctccttcggt 


cctccgatcg ttgtcagaag 


5640 


taagttggcc 


gcagtgttat 


cactcatggt 


tatggcagca 


ctgcataatt ctcttactgt 


5700 


catgccatcc 


gtaagatgct 


tttctgtgac 


tggtgagtac 


tscaaccaagt cattctgaga 


5760 


atagtgtatg 


cggcgaccga 


gttgctcttg 


cccggcgtca 


acacgggata ataccgcgcc 


5820 


acatagcaga 


actttaaaag 


tgctcatcat 


tggaaaacgt 


tcttcggggc gaaaactctc 


5880 


aaggatctta 


ccgctgttga 


gatccagttc gatgtaaccc 


actcgtgcac ccaactgatc 


5940 


ttcagcatct 


tttactttca 


ccagcgtttc 


tgggtgagca 


aaaacaggaa ggcaaaatgc 


6000 


cgcaaaaaag 


ggaataaggg 


cgacacggaa 


atgttgaata 


ctcatactct tcctttttca 


6060 


atattattga 


agcatttatc 


agggttattg 


tctcatgagc 


ggatacatat ttgaatgtat 


6120 


ttagaaaaat 


aaacaaatag 


gggttccgcg 


cacatttccc 


cgaaaagtgc cacctgacgt 


6180 


ctaagaaacc 


attattatca 


tgacattaac 


ctataaaaat 


aggcgtatca cgaggccctt 


6240 


tcgtcttcaa 










6250 
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<210> 17 

<211> 117 

<212> DNA 

<213> ARTIFICIAL 

<220> ^ ~ - 

<223> (SI/Gtxl33-141) 5beta/RPh 

ttctgacatc cggcgggatt ctgacatccg gcgggattct gacatccggc gggattctga 60 
catccggcgg gattctgaca tccggcggga gactcacaac cccagaaaca gacatcc 117 

<210> 18 

<211> 117 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> {Sl/Gtxl33-141) 5beta/RPh 



<400> 18 

ttctgacatc cggcggaatt ctgacatccg gcggaattct gacatccggc ggaattctga 
catccggcgg aattctgaca tccggcggaa gactcacaac cccagaaaca gacatcc 



<210> 19 
<211> 117 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 19 

ttctgacatc cggcgaaatt ctgacatccg gcgaaattct gacatccggc gaaattctga 
catccggcga aattctgaca tccggcgaaa gactcacaac cccagaaaca gacatcc 



60 
117 



60 
117 



<210> 20 

<211> 117 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> (SI/Gtxl33-141) 5beta/RPh 
<400> 20 

ttctgacatc cggcgggtct ctgacatccg 
catccggcgg gtctctgaca tccggcgggt 



gcgggtctct gacatccggc gggtctctga 
cactcacaac cccagaaaca gacatcc 



<210> 21 

<211> 117 

<212> DNA 

<213> ARTIFICIAL 



<220> 
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<223> (Sl/Gtxl33-141) 5beta/RPh 
<400> 21 

ttctgacatc cggcgggtca ctgacatccg gcgggtcact gacatccggc gggtcactga 60 
catccggcgg gtcactgaca tccggcgggt cactcacaac cccagaaaca gacatcc 117 



<210> 22 

<211> 117 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> (SI/Gtxl33-141) 5beta/RPh 
<400> 22 

ttctgacatc cggcgggtca ttgacatccg gcgggtcatt gacatccggc gggtcattga 60 
catccggcgg gtcattgaca tccggcgggt cattcacaac cccagaaaca gacatcc 117 



<210> 23 
<211> 151 
<212> DNA 

<213> Rattus norvegicus 
<400> 23 

cagaagcccc aagctcgtca atcaagctgg ttctccattt gcactcagga gcacgggcag 60 
gcgagtggcc cctagttctg ggagcagagt atcagcatcc cagtcctagt cccgagccta 120 
aagcctcgcc tgcctgccca gtgccaggat g 151 



<210> 24 
<211> 148 
<212> DNA 

<213> Rattus norvegicus* 
<400> 24 

gttattctcc gaggggcccg acagcccccg ggagctccag gatgaggagt ccggcagctg 60 
cctctgggtg cagaagtcca agctgttggt gattgaagtg aagactattt cctgtcatta 120 
tagccgtcgc gctgcttctc gacagtcc 148 



<210> 25 

<211> 216 

<212> DNA 

<213> Rattus norvegicus 



<400> 25 

agtgctctgg cgagtagtcc tccctcagcc gcagtctctg ggcctcttca gcttgagcgg 60 

cggcgagcct gccacactcg ctaagctcct ccggcaccgc gcacttgcca ctgccactgc 120 

cgcttcgcgc ccgctgcagc cgccggctct gaatccttct ggcttccgcc tcagaggagt 180 
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tcttagcctg tcccgaaccg taaccccggc gagcag 



<210> 26 

<211> 367 

<212> DNA . . 

<213> Rattus norvegicus 



216 



catcctctct acctccgttc ccttcggtct ttctctttct tctccttctt tccctcgttt 
cttcggtcgc cgggtctccc cggctcctcc cacggaggag ctctagctct gagagtctgc 
cgtgaccgcg ggtgcgtcca ctttccgtgc ccagattttt attgatccga accaacctat 
attgtgactg ctggcaagaa ttaagattct tcagcggggc tctaacagag ggagcgctga 
ttgggaagca ttcagtcagg aaattaaaag aaagaagcca gaacatacca ccagcccttt 
gcgtatacca cataacaaac gtcattactt tacaacttga ttaggagaca gtacagagat 
ctgaagg 

<210> 27 
<211> 249 
<212> DNA 

<213> Rattus norvegicus 

ctccagttct ccccgcccac cctacagaaa gtgtctcctg attggctttg aggccgcagg 
gctcaggtta cattcgcaag agttgcggag cgcgggagac cggacccaag aggagagagg 
ctggttctgc accgattctg tgctggtccg ggagtgcccg acagcccctg aactaccacc 
cagcattgta caaacccacc cccactctga gccaggctcc accccagcca aggaccctca 
acaccggca 

<210> 28 
<211> 178 
<212> DNA 

<213> Saccharomyces cerevisiae 

Iccgattaag cacagtacct ttacgttata tataggattg gtgtttagct ttttttcctg 
agcccctggt tgacttgtgc atgaacacga gccattttta gtttgtttaa gggaagtttt 
ttgccaccca aaacgtttaa agaaggaaaa gttgtttctt aaaccatgag tgtgtcta 



<210> 29 

<211> 511 

<212> DNA 

<213> Saccharomyces cerevisiae 



60 
120 
180 
240 
300 
360 
367 



60 
120 
180 
240 
249 



60 
120 
178 
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<400> 29 

aatcattttt ttgaaaatta cattaataag gcttttttca atatctctgg aacaacagtt 60 

tgtttctact tactaatagc tttaaggacc ctcttggaca tcatgatggc agacttccat 120 

" cgtagtggga tgatcatatg atgggcgcta tcctcatcgc-^gactcgataa cgacgtgaga 180 

aacgattttt ttttttcttt ttcaccgtat ttttgtgcgt cctttttcaa ttatagcttt 240 

tttttatttt ttttttttct cgtactgttt cactgacaaa agtttttttt caagaaaaat 300 

tttcgatgcc gcgttctctg tgtgcaacgg atggatggta gatggaattt caatatgttg 360 

cttgaaattt fcaccaatctt gatattgtga taatttactt aattatgatt cttcctcttc 420 

ccttcaattt cttaaagctt cttactttac tccttcttgc tcataaataa gcaaggtaag 480 

aggacaactg taattaccta ttacaataat g 511 



<210> 30 
<211> 10 
<212> RKA 

<213> Saccharomyces cerevisiae 
<400> 30 

acgagccauu 10 



<210> 31 
<211> 10 
<212> RNA 

<213> Saccharomyces cerevisiae 
<400> 31 

aauggcucau 10 



<210> 32-^ ^- - .^.W:=*^-V... 

<211> 16 
<212> RNA 

<213> Saccharomyces cerevisiae 
<400> 32 

gaaauuugca aaaccc 16 



<210> 33 
<211> 16 

<212> RNA 

<213> Saccharomyces cerevisiae 
<400> 33 

cuuagaacgu ucuggg 16 



<210> 34 
<211> 13 
<212> RNA 
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<213> Saccharomyces cerevisiae 

<400> 34 
cagacuucca ucg 



<210> 35 

<211> 13 

<212> RNA 

<213> Saccharomyces cerevisiae 

<400> 35 
cgauggaagu uug 



<210> 36 
<211> 19 
<212> RNA 

<213> Saccharomyces cerevisiae 
<400> 36 

gcgcuauccu caucgcgac 



<210> 37 
<211> 19 
<212> RNA 

<213> Saccharomyces cerevisiae 
<400> 37 

gucgugcugg ggauagagc 



<210> 38 

<211> 15 

<212> RNA 

<213> Saccharomyces cerevisiae 

<400> 38 
uuaugauucu uccuc 



<210> 39 

<211> 15 

<212> RNA 

<213> ARTIPICIAIi 

<220> 

<223> 18s rRNA 

<400> 39 
gcggaaggau cauua 



<210> 40 

<211> 25 

<212> RNA 

<213> Saccharomyces 



cerevisiae 
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<400> 40 

cuucccuuca auuucuuaaa gcuuc 



25 



<210> 41 

<211>--25 

<212> RNA 

<213> Saccharomyces cerevisiae 
<400> 41 

gaaacuuaaa ggaauugacg gaagg 25 



<210> 42 

<211> 30 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> lRES-29 



<210> 43 

<211> 30 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> IRES-23 

<400> 43 

gatccagagc aggaacagcg gaaacgagcg 30 



<210> 44 

<211> 30 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> IRES-18 



<210> 45 

<211> 30 

<212> DNA 

<213> ARTIFICIT^ 

<220> 

<223> IRES-16 



<400> 42 

gatccaggca gaggagagga gggacgagcg 



30 



<400> 44 

gatccaacgt agttaagccg gagccgagcg 



30 



<400> 45 

gatccaggaa ggtgaacgtc caatcgagcg 



30 
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<210> 46 

<211> 30 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> IRES-15 

<400> 46 

gatccagagg agacgcatag agaacgagcg 



<210> 47 

<211> 42 

<212> DNA 

<213> ARTIFICIAL 

<220> ^ , 

<223> oligonucleotide containing 18 random nucleotide 



<220> 

<2 2 1 > mi sc_f eature 

<222> (13).. (30) 

<223> n is eithe a, c, g, or t 



<400> 47 , 2 
acgcgtgatc cannnnnnnn nnnnnnnnnn cgagcgacgc gt 



<210> 48 

<211> 135 

<212> DNA 

<213> ARTIFICIAL 

<223> oligonucleotide containing two segments of 9 random nucleotides 

<220> 

<221> misc_f eature 

<222> (69) . . (77) 

<223> n is either a, c, g, or t 



<220> 

<221> misc_f eature 

<222> (87). -(95) 

<223> n is either a, c, g, or t 



<400> 48 ^ ^. ^ 

ttaattaaga attcttctga cataaaaaaa aattctgaca taaaaaaaaa ttctgacata 60 

aaaaaaaann nnnnnnnaaa aaaaaannnn nnnnnaaaaa aaaagactca caaccccaga 120 

135 

aacagacata cgcgt 
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<210> 49 

<211> 30 

<212> RNA 

<213> ARTIPICTAL 



<220> 

<223> ICS 1-23 a-b 



<400> 49 

gauccagagc aggaacagcg gaaacgagcg 30 



<210> 50 

<211> 15 

<2l'2> RNA 

<213> ARTIFICIAL 

<220> 

<223> ICS 1-23 b 

<400> 50 

cagcggaaac gagcg 15 



<210> 51 
<211> 14 
<212> RNA 

<213> Saccharomyces cerevisiae 

<400> 51 

uucucgauuc cgug 14 



<210> 52 

<211> 9 

<212> DNA 

<213> ARTIPICITUj 

<220> 

<223> 9 nt segment designated as ICA2-17.2 

<400> 52 

tccggtcgt 9 



<210> 


53 


<211> 


148 


<212> 


DNA 


<213> 


ARTIFICIAL 


<220> 




<223> 


two random 




a-globin 5 



UT 



<400> 53 

gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattct 60 
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gacataaaaa aaaagggttg tacaaaaaaa aacccagttt caaaaaaaaa gactcacaac 120 

148 

cccagaaaca gacatacgcg tcgccatg 

<210> 54 
<211> 148 
<212> DNA 
<213> ARTIFICIAL 

<223> two random 9 nt sequences flanked by poly (a) 9 and embedded in b 
eta-globin 5' UT 



gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattct 
gacataaaaa aaaaatgatt tgtaaaaaaa aagtgtcgat gaaaaaaaaa gactcacaac 
cccagaaaca gacatacgcg tcgccatg 



60 

120 
148 



<210> 55 
<211> 147 
<212> DNA 
<213> ARTIFICIAL 

<223> two random 9 nt sequences flanked by poly (a) 9 and embedded in b 
eta-globin 5' UT 

gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 60 
acataaaaaa aaacacatcc cgaaaaaaaa attatatatc aaaaaaaaag actcacaacc 120 

147 

ccagaaacag acatacgcgt cgccatg 

<210> 56 
<211> 147 
<212> DNA 
<213> ARTIFICIAL 

<223> two random 9 nt sequences flanked by poly (a) 9 and embedded in b 
eta-globin 5' UT 

gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 60 
acataaaaaa aaaaccctgc tcaaaaaaaa agcgtggtag aaaaaaaaag actcacaacc 120 

147 

ccagaaacag acatacgcgt cgccatg 



<210> 57 
<21l> 147 
<212> DNA 
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<213> ARTIFICIAL 
<220> 

<223> two random 9 nt sequences flanked by poly (a) 9 and embedded in b 
eta-globin 5' UT 

<400> 57 

gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 60 

acataaaaaa aaaatccggg gtaaaaaaaa attaaataaa aaaaaaaaag actcacaacc 120 

ccagaaacag acatacgcgt cgccatg ' 147 



<210> 


58 


<211> 


147 


<212> 


DNA 


<213> 


ARTIFICIAL 


<220> 




<223> 


two random 9 nt sequences 




eta-globin 5' OT 


<400> 


58 



gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 60 
acataaaaaa aaaatcttaa agaaaaaaaa atgcagcgcg aaaaaaaaag actcacaacc 120 
ccagaaacag acatacgcgt cgccatg 147 



<210> 


59 




<211> 


147 


- .i 


<212> 


DNA 




<213> 


ARTIFICIAL 




<220> 






<223> 


two random 9 


nt sequences flanked by poly (a) 9 and embedded in b 




eta-globin 5' 


' UT 


<400> 


59 





gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 60 
acataaaaaa aaatatcgtc ttaaaaaaaa aggttgcact aaaaaaaaag actcacaacc 120 
ccagaaacag acatacgcgt cgccatg 147 



<210> 


60 


<211> 


147 


<212> 


DNA 


<213> 


ARTIFICIAL 


<220> 




<223> 


two random 



eta-globin 5 » UT 
<400> 60 

gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 60 
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acataaaaaa aaattggtaa agaaaaaaaa agaagacccg aaaaaaaaag actcacaacc 120 
ccagaaacag acatacgcgt cgccatg 

<210> 61 
<211> 147 
<212> DNA 
<213> ARTIFICIAL 

<223> two random 9 nt sequences flanked by poly (a) 9 and embedded in b 
eta-globin 5» UT 

<400> 61 

gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 50 
acataaaaaa aaagcatgag taaaaaaaaa agctcaccta aaaaaaaaag actcacaacc 120 

147 

ccagaaacag acatacgcgt cgccatg 

<210> 62 

<211> 147 

<212> DNA 

<213> ARTIPICIAI* 

<220> 

<223> two random 9 nt sequences flanked by poly (a) 9 and embedded in b 
eta-globin 5' UT 

<400> 62 ^- 
gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 60 

acataaaaaa aaatcggacg ttaaaaaaaa atccggtcgt aaaaaaaaag actcacaacc 120 

147 

ccagaaacag acatacgcgt cgccatg 

<210> 63 

<211> 147 

<212> DNA 

<213> ARTIFICIAXi 

<220> 

<223> two random 9 nt sequences flanked by poly (a) 9 and embedded in b 
eta-globin 5' UT 

<400> 63 

gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 60 
acataaaaaa aaatttactg ctaaaaaaaa atggctgttc aaaaaaaaag actcacaacc 120 

147 

ccagaaacag acatacgcgt cgccatg 



<210> 64 
<211> 147 
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<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> two random 9 nt sequences flanked by poly (a) 9 and embedded in b 
eta-globin 5* UT 

<400> 64 

gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 60 

acataaaaaa aaattaagta gcaaaaaaaa attgtttagt aaaaaaaaag actcacaacc 120 

ccagaaacag acatacgcgt cgccatg 147 

<210> 65 

<211> 147 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> two random 9 nt sequences flanked by poly (a) 9 and embedded in b 
eta -glob in 5 ' OT 

<400> 65 

gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 60 

acataaaaaa aaaatcttgc gtaaaaaaaa agttcctgcg aaaaaaaaag actcacaacc 120 

ccagaaacag acatacgcgt cgccatg 147 

<210> 66 

<211> 147 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223>-^ two rcuidom 9 nt sequences flanked by poly (a) 9 and embedded in b 
eta-globin 5' UT 

<400> 66 

gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 60 

acataaaaaa aaaagcatga gtaaaaaaaa aagctcacct aaaaaaaaag actcacaacc 120 

ccagaaacag acatacgcgt cgccatg 147 



<210> 67 

<211> 147 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> two random 9 nt sequences flanked by poly (a) 9 and embedded in b 
eta-globin 5' UT 

<400> 67 
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gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 60 

acataaaaaa aaagcagtta ataaaaaaaa agacatcagc aaaaaaaaag actcacaacc 120 

ccagaaacag acatacgcgt cgccatg ^^'^ 

<210> 68 
<211> 147 
<212> DNA 
<213> ARTIFICIAL 

<223> two random 9 nt sequences flanked by poly (a) 9 and embedded xn b 
eta-globin 5» UT 



<400> 68 

gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 
acataaaaaa aaaaaggctg ccaaaaaaaa aaacgtttag aaaaaaaaag actcacaacc 120 

147 

ccagaaacag acatacgcgt cgccatg 

<210> 69 
<211> 147 
<212> DNA 
<213> ARTIFICIAL 

<223> two random 9 nt sequences flanked by poly (a) 9 and embedded in b 
eta-globin 5* UT 

<400> 69 

gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 60 
acataaaaaa aaacttggct caaaaaaaaa aagttcctgt aaaaaaaaag actcacaacc 120 

147 

ccagaaacag acatacgcgt cgccatg 

<210> 70 

<211> 147 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> two random 9 nt sequences flanked by poly (a) 9 and embedded in b 
eta-globin 5' UT 

<400> 70 ^. 
gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 60 

acataaaaaa aaattcaaca tgaaaaaaaa aagcgtctat aaaaaaaaag actcacaacc 120 

. 147 

ccagaaacag acatacgcgt cgccatg 



60 



<210> 71 
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<211> 147 
<212> DNA 
<213> ARTIFICIAL 

<220> 

<223> two random 9 nt sequences flanked by poly (a) 9 and embedded in b 
eta-globin 5» UT 

<400> 71 

gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 60 

acataaaaaa aaatcggccg ttaaaaaaaa acccggtggt aaaaaaaaag actcacaacc 120 

ccagaaacag acatacgcgt cgccatg 147 



<210> 72 

<211> 147 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> two random 9 nt sequences flanked by poly (a) 9 and embedded in b 
eta-globin 5 » UT 

<400> 72 

gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 60 

acataaaaaa aaagcgcgtt agaaaaaaaa acctgccttc aaaaaaaaag actcacaacc 120 

ccagaaacag acatacgcgt cgccatg 147 



<210> 73 

<211> 145 

<212> DNA 

<213> ARTIFICIAL 

<220> . ' 

<223> two random 9 nt secjuences flanked by poly (a) 9 and embedded in b 
eta-globin 5 ' UT 

<400> 73 

gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 60 

acataaaaaa aaacttggct caaaaaaaaa gttcctgtaa aaaaaaagac tcacaacccc 120 

agaaacagac atacgcgtcg ccatg 145 



<210> 74 

<211> 147 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> two random 9 nt sequences flauiked by poly (a) 9 and embedded in b 
eta-globin 5' UT 
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<400> 74 

gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 60 
acataaaaaa aaacgttgat tcaaaaaaaa aacgacctcc aaaaaaaaag actcacaacc 120 
ccagaaacag acatacgcgt cgccatg ^^'^ 

<210> 75 
<211> 146 
<212> DNA 
<213> ARTIFICIAL 

<223> two random 9 nt sequences flanked by poly (a) 9 and embedded in b 
eta-globin 5' UT 

<400> 75 ^ ^. 

gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 60 

acataaaaaa aaagtcactg ccaaaaaaaa atcgatatta aaaaaaaaga ctcacaaccc 120 
cagaaacaga catacgcgtc gccatg 

<210> 76 
<211> 147 
<212> DNA 
<213> ARTIFICIAIj 

<223> two random 9 nt sequences flanked by poly (a) 9 and embedded in b 
eta-globin 5' UT 

<400> 76 

gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 60 
acataaaaaa aaagagagtg agaaaaaaaa atgtcctatt aaaaaaaaag actcacaacc 120 

147 

ccagaaacag acatacgcgt cgccatg 

<210> 77 
<211> 147 
<212> DNA 
<213> ARTIFICIAL 

<223> two random 9 nt sequences flanked by poly (a) 9 and embedded in b 
eta-globin 5' UT 

<400> 77 ^- 
gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 60 

acataaaaaa aaacgttgcg tcaaaaaaaa acgcccggcc aaaaaaaaag actcacaacc 120 

147 

ccagaaacag acatacgcgt cgccatg 
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<210> 78 
<211> 147 
<212> DNA 
<213> ARTIFICIAL 

,<220> , 

<223> two random 9 nt sequences flanked by poly (a) 9 and embedded in b 
eta-globin 5 » UT 

<400> 78 

gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 60 

acataaaaaa aaacgcccgg gtaaaaaaaa acatggcggt aaaaaaaaag actcacaacc 120 

ccagaaacag acatacgcgt cgccatg 147 



<210> 79 

<211> 147 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> two random 9 nt sequences flanked by poly (a) 9 and embedded in b 
eta-globin 5* UT 

<400> 79 

gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 60 

acataaaaaa aaacaatttc gtaaaaaaaa agtccgggac aaaaaaaaag actcacaacc 120 

ccagaaacag acatacgcgt cgccatg 147 



<210> 80 

<211> 147 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> two random 9 nt sequences flanked by poly (a) 9 and embedded in b 
eta-globin 5 ' UT 

<400> 80 

gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 60 

acataaaaaa aaaagatagc ctaaaaaaaa attgcttatg aaaaaaaaag actcacaacc 120 

ccagaaacag acatacgcgt cgccatg 147 



<210> 81 

<211> 145 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> two random 9 nt sequences flanked by poly (a) 9 and embedded in b 
eta-globin 5' UT 
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<400> 81 ^_ 
gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 50 

acataaaaaa aaaccaggcg gaaaaaaaaa ctacttccaa aaaaaaagac tcacaacccc 120 
agaaacagac atacgcgtcg ccatg 

<210> 82 

<211> 147 

<212> DNA 

<213> ARTIPICIAIi 

<220> 1^ J • v. 

<223> two random 9 nt sequences flanked by poly (a) 9 and embedded xn b 

eta-globin 5» UT 

<400> 82 ^_ 
gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 60 

acataaaaaa aaataatcgt agaaaaaaaa aagcaggatg aaaaaaaaag actcacaacc 120 
ccagaaacag acatacgcgt cgccatg ^^'^ 

<210> 83 
<211> 147 
<212> DNA 
<213> ARTIFICIAL 

<223> two random 9 nt sequences flanked by poly (a) 9 and embedded in b 
eta-globin 5* UT 

<400> 83 

gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 60 

acataaaaaa aaatggcatg agaaaaaaaa atgggcggct aaaaaaaaag actcacaacc 120 

ccagaaacag acatacgcgt cgccatg ^^'^ 



<210> 84 
<211> 146 
<212> DNA 
<213> ARTIFICIAL 

<223> two random 9 nt sequences flanked by poly (a) 9 and embedded in b 
eta-globin 5' UT 

<400> 84 

gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 60 

acataaaaaa aaattcaaca tgaaaaaaaa agcgtctata aaaaaaaaga ctcacaaccc 120 

cagaaacaga catacgcgtc gccatg 
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<210> 


85 


<211> 


147 


<212> 


DNA 


<213> 


ARTIFICIAL 


<220> 




<223> 


two random 



eta-globin 5' UT * 
<400> 85 

gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 60 
acataaaaaa aaacggcgcc agaaaaaaaa aatgtgtggt aaaaaaaaag actcacaacc 120 
ccagaaacag acatacgcgt cgccatg 147 



<210> 


86 


<211> 


147 


<212> 




<213> 


ARTIFICIAL 


<220> 




<223> 


two random 9 nt sequences 




eta-globin 5' UT 


<400> 


86 



gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 60 
acataaaaaa aaagtcatat caaaaaaaaa acggtgagcg aaaaaaaaag actcacaacc 120 
ccagaaacag acatacgcgt cgccatg 147 



<210> 


87 


<211> 


147 


<212> 


DNA 


<213> 


ARTIFICIAL 


<220> 




<223> 


two random 



eta-globin S' UT 
<400> 87 

gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 60 
acataaaaaa aaaggcggcg gaaaaaaaaa atcgtttgcc aaaaaaaaag actcacaacc 120 
ccagaaacag acatacgcgt cgccatg 147 



<210> 88 

<211> 147 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> two random 9 nt sequences flanked by poly (a) 9 and embedded in b 
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eta-globin 5' OT 
<400> 88 

gctagcttaa ttaagaattc ttctgacata aaaaaaaatt ctgacataaa aaaaaattcg 60 
acataaaaaa aaagtaaccc taaaaaaaaa aagcccgctt aaaaaaaaag actcacaacc 120 
ccagaaacag acatacgcgt cgccatg ^^"^ 



<210> 89 

<211> 9 

<212> DNA 

<213> ARTIFICIAIi 

<220> 

<223> random 9 nt sequence 

<400> 89 
gggttgtac 



<210> 90 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 90 
cccagtttc 



<210> 91 

<211> 9 

<212> DNA 

<213> ARTIFICIAIj 

<220> 

<223> random 9 nt sequence 

<400> 91 
atgatttgt 



<210> 92 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 92 
gtgtcgatg 



<210> 93 
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<211> 9 

<212> DNA 

<213> ARTIFICIAL . 

<220> 

<223> randont 9 nt sequence 

<400> 93 
cacatcccg 



<210> 94 

<211> 9 

<212> DNA 

<213> ARTIPICIAl 

<220> 

<223> random 9 nt sequence 

<400> 94 
ttatatatc 



<210> 95 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 95 
accctgctc 



<210> 96 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 96 
gcgtggtag 



<210> 97 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt secjuence 

<400> 97 
atccggggt 
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<210> 98 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 98 
ttaaataaa 



<210> 99 

<211> 9 

<212> DNA 

<213> ARTIFICIALi 

<220> 

<223> random 9 nt sequence 

<400> 99 
atcttaaag 



<210> 100 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 100 
tgcagcgcg 



<210> 101 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 101 
tatcgtctt 



<210> 102 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 102 
ggttgcact 
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<210> 103 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt secpience 

<400> 103 
ttggtaaag 



<210> 104 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sec[uence 

<400> 104 
gaagacccg 



<210> 105 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 105 
gcatgagta 



<210> 106 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 106 
gctcaccta 



<210> 107 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 107 
tcggacgtt 
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<210> 108 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 108 
tccggtcgt 



<210> 109 

<211> 9 

<212> DNA 

<213> ARXIPICIAIi 

<220> 

<223> random 9 nt sequence 

<400> 109 
tttactgct 



<210> 110 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 110 
tggctgttc 



<210> 111 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 111 
ttaagtagc 



<210> 112 

<211> 9 

<212> DNA 

<213> ARTIFICI7UJ 

<220> 

<223> random 9 nt sequence 



<400> 112 
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ttgtttagt 



<210> 113 

<211> 9 

<212> DNA - - 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 113 
atcttgcgt 



<210> 114 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt setjuence 

<400> 114 
gttcctgcg 



<210> 115 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 115 

agcatgagt 9 



<210> 116 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 116 

agctcacct 9 



<210> 117 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 



<220> 

<223> random 9 nt sequence 
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<400> 117 
gcagttaat 



<210> 118 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 118 
gacatcagc 

<210> 119 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 119 
aaggctgcc 



<210> 120 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 120 
aacgtttag 



<210> 121 

<211> 9 

<212> DNA 

<213> 2\RTIFICIAL 

<220> 

<223> random 9 nt sequence 

<:400> 121 
cttggctca 



<210> 122 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 
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<400> 122 

agttcctgt 



<210>_ 123 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 



<220> 

<223> random 9 nt sequence 

<400> 123 
ttcaacatg 



<210> 124 

<211> 8 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 124 

gcgtctat 8 



<210> 125 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 



<220> 

<223> random 9 nt sequence 
<400> 125 

tcggcj'cgtt 9 



<210> 126 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 126 

cccggtggt 9 



<210> 127 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 



<220> 
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<223> random 9 nt sequence 

<400> 127 
gcgcgttag 

<210> 128 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> rcLndom 9 nt sequence 

<400> 128 
cctgccttc 



<210> 129 

<211> 8 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 129 
cttggctc 



<210> 130 

<211> 8 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 130 
gttcctgt 



<210> 131 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 131 
cgttgattc 



<210> 132 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 
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<220> 

<223> random 9 nt sequence 
<400> 132 • 

acgacctcc 9 



<210> 133 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 133 

gtcactgcc 9 



<210> 134 

<211> 8 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 134 

tcgatatt ^ 8 



<210> 135 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 135 

gagagtgag 9 



<210> 136 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 136 

tgtcctatt 9 



<210> 137 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 
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<220> 

<223> random 9 nt sequence 

<400> 137 
cgttgcgtc 



<210> 138 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 138 
cgcccggcc 



<210> 139 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 139 
cgcccgggt 



<210> 140 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 140 
catggcggt 



<210> 141 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 141 
caatttcgt 



<210> 142 
<211> 9 
<212> DNA 
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<213> ARTIFICIAL 
<220> 

<223> random 9 nt sequence 
<400> -142 

gtccgggac 9 



<210> 143 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 143 

agatagcct - 9 



<210> 144 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt secjuence 

<400> 144 

ttgcttatg 9 



<210> 145 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 145 

ccaggcgga 9 

<210> 146 

<211> 8 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 146 

ctacttcc 8 



<210> 147 
<211> 9 
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<212> DNA 

<213> ARTIFICITVL 

<220> 

<223> random 9 nt sequence 

<400> 147 
taatcgtag 



<210> 148 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 148 
agcaggatg 



<210> 149 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 149 
tggcatgag 



<210> 150 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 150 
tgggcggct 



<210> 151 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 151 
ttcaacatg 



<210> 152 
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<211> 8 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9*nt sequence 

<400> 152 
gcgtctat 



<210> 153 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

i 

<400> 153 

cggcgccag 

<210> 154 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 154 
atgtgtggt 



<210> 155 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 155 
gtcatatca 



<210> 156 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 156 
cggtgagcg 
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<210> 157 
<211> 9 
<212> DNA 
<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 157 
ggcggcgga 



<210> 158 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sequence 

<400> 158 

tcgtttgcc 9 



<210> 159 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt sec[uence 

<400> 159 

gtaacccta 9 



<210> 160 

<211> 9 

<212> DNA 

<213> ARTIFICIAL 

<220> 

<223> random 9 nt secjuence 

<400> 160 

agcccgctt 9 



<210> 161 

<211> 156 

<212> DNA 

<213> Rattus norvegicus 



<400> 161 

agtttcctcg gcggcgggag gcgagagcac taggagcaga gagagcgcgg gggccaccgg 60 
agacggcggc ggcggcggcg acgacgcgga cacagtcagg gtgcggcgga tcttccactt 120 



gcacacggag cagtcggtgg cccacgcagg atcacg 



156 
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<210> 162 

<211> 427 

<212> DNA ^ _ 

<213> Rattus norvegicus 

<400> 162 

ccccagcctg cctaggtgct gggagccggg agctggatta tggtggcctg agcagccgac 60 

gcagccgcag gagctgggag tccctcacgc tgcaaagtcc gcctggaaga ccctgaaagc 120 

tgcaggctcc gatagccatg cccgcccctc ccagccccac aaggggcccg atccccccgc 180 

tgaggctggc ggtcgccgtc cagatgtagc tgggtccccc ggatcgccat cgtcctcttc 240 

tctcgtgcgc tacagatttc tcctgcccac tctccaccgc cgggagcagg aactgagcga 300 

ggggcctgca gactctgcag tcctgatgcc cccgaggccg ctctcctgag agaagccacc 360 

accacccaga cttaggggca ggcaagaggg acagtcgcca accggagcca caaggcccgg 420 

gctcacc 427 



<210> 163 

<211> 321 

<212> DNA 

<213> Homo sapiens 

<4C0> 163 



ccgcgggggg 


agggaacagc gttgatcacg 


tgacgtggtt 


tcagtgttta 


cacccgcagc 


60 


gggccggggg 


ttcggcctca gtcaggcgct 


cagctccgtt 


tcggtttcac 


ttccggtgga 


120 


gggccgcctc 


tgagcgggcg gcgggccgac 


ggcgagcgcg 


ggcggcggcg 


gtgacggagg 


180 


cgccgctgcc 


agggggcgtg cggcagcgcg 


gcggcggcgg 


cggcggcggc 


ggcggcggag 


240 


gcggcggcgg 


cggcggcggc ggcggcggct 


gggcctcgag 


cgcccgcagc 


ccacctctcg 


300 


ggggcgggct 


cccggcgcta g 








321 


<210> 164 
<211> 723 
<212> DNA 
<213> Mus 


musculus 










<400> 164 
cacgaggcat 


tacccagtag actcttacac 


tgaacctccc 


aatcctaact 


tagtacatgc 


60 


atacaataaa 


tgctcactaa gtactgatat 


atgattaaaa cacttgaaaa 


caaacaaact 


120 


gtaatgtaca 


tgacataatc gtgatgagca 


gtaagtagga 


acatcttctt 


ggcaaaggaa 


180 


ggagtcaaac 


aatgagcagg ggctgtgagt 


gtgggtcagt 


ggtagcacag 


catgcatgag 


240 


gccctatatt 


tggtctctag caacacacat 


aaaagatgaa 


caaaaccaga 


cagaggaaga 


300 
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ggtatgtcga gagtcttaaa taaaacaggt cagatctaaa acaactttta ggagaaacag 360 

aaagatactt actttcggtc actcccccaa tagcaagaga aataatagct aaaacgttct 420 

cacatgcgga atgatttata atttcttctt ccagaacacc tctgaaagct tggtcaaggg 480 

tacatttttt ttcattttca ctgccaggta actgactgaa ggcagtcaac aacggcttga 540 

tatttttgtt attcaaggct tctctggtag atttcgtaaa tcgcgtccgc gcctccggca 600 

aactgaaaag cgctttatca ggcgtcttcc cgcgccgcag tctctctgtt ctcccggttc 660 

cttcgagctc gtcgtctctg ccgtcctctg acttttaatt tccaggactt gccttctgcc 720 

atg 723 



<210> 165 

<211> 83 

<212> DNA 

<213> Mus musculus 

<400> 165 

aaaacgttct cacatgcgga atgatttata atttcttctt ccagaacacc tctgaaagct 60 
tggtcaaggg tacatttttt ttc 83 



<210> 166 

<211> 80 

<212> DNA 

<213> Mus musculus 

<400> 166 

acgttctcac atgcggaatg atttataatt tcttcttcca gaacacctct gaaagcttgg 60 
tcaagggtac attttttttc 80 



<210> 167 

<211> 77 

<212> DNA 

<213> Mus musculus 

<400> 167 

ttctcacatg cggaatgatt tataatttct tcttccagaa cacctctgaa agcttggtca 60 
agggtacatt ttttttc 77 



<210> 168 

<211> 74 

<212> DNA 

<213> Mus musculus 



<400> 168 

tcacatgcgg aatgatttat aatttcttct tccagaacac ctctgaaagc ttggtcaagg 60 
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gtacattttt tttc 74 



<210> 169 

<211> 71 

<212> DNA . - - 

<213> Mus musculus 

<400> 169 

catgcggaat gatttataat ttcttcttcc agaacacctc tgaaagcttg gtcaagggta 60 
catttttttt c 71 



<210> 170 

<211> 69 

<212> DNA 

<213> Mus musculus 

<400> 170 

tgcggaatga tttataattt cttcttccag aacacctctg aaagcttggt caagggtaca 60 
ttttttttc ' 69 



<210> 171 

<211> 65 

<212> DNA 

<213> Mus musculus 



<400> 171 

gaatgattta taatttcttc ttccagaaca cctctgaaag cttggtcaag ggtacatttt 60 
ttttc 65 



<210> 172 

<211> 62 

<212> DNA 

<213> Mus musculus 

<400> 172 

tgatttataa tttcttcttc cagaacacct ctgaaagctt ggtcaagggt acattttttt 60 
tc 62 



<210> 173 

<211> 59 

<212> DNA 

<213> Mus musculus 



<400> 173 

tttataattt cttcttccag aacacctctg aaagcttggt caagggtaca ttttttttc 59 



<210> 174 
<211> 58 
<212> DNA 
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<213> Mus musculus 
<400> 174 

ttataatttc ttcttccaga acacctctga aagcttggtc aagggtacat tttttttc 58 



<210> 175 

<211> 57 

<212> DNA 

<213> Mus musculus 

<400> 175 

tataatttct tcttccagaa cacctctgaa agcttggtca agggtacatt ttttttc 57 



<210> 176 

<211> 53 

<212> DNA 

<213> Mus musculus 

<400> 176 

atttcttctt ccagaacacc tctgaaagct tggtcaaggg tacatttttt ttc 53 



<210> 177 

<211> 50 

<212> DNA 

<213> Mus musculus 

<400> 177 

tcttcttcca gaacacctct gaaagcttgg tcaagggtac att ttttttc 50 



<210> 178 

<211> 47 

<212> DNA 

<213> Mus musculus 

<400> 178 

tcttccagaa cacctctgaa agcttggtca agggtacatt ttttttc 47 



<210> 179 

<211> 44 

<212> DNA 

<213> Mus musculus 

<400> 179 

tccagaacac ctctgaaagc ttggtcaagg gtacattttt tttc 44 



<210> 180 

<211> 42 

<212> DNA 

<213> Mus musculus 



<400> 180 

cagaacacct ctgaaagctt ggtcaagggt acattttttt tc 



42 
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<210> 


181 




<211> 


62 




<:212> 


DNA 




<213> 


Mus musculus 




<400> 


181 




tgatttataa tttcttcttc caaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 


60 


aa 




62 


<210> 


182 




<211> 


62 




<212> 


DNA 




<213> 


Mus musculus 




<400> 


182 




tgatttataa tttcttcttc cagaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 


60 


aa 




62 


<210> 


183 




<211> 


62 




<212> 


DNA 




<213> 


Mus musculus 




<400> 


183 





tgatttataa tttcttcttc cagaacacaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 60 
aa 62 



<210> 184 

<211> 62 

<212> DNA 

<213> Mus musculus 

<400> 184 

tgatttataa tttcttcttc cagaacacct caaaaaaaaa aaaaaaaaaa aaaaaaaaaa 60 
aa 62 



<210> 


185 




<211> 


62 




<212> 


DNA 




<213> 


Mus 


musculus 


<400> 


185 




tgatttataa 


tttcttcttc 



gaacacct ctgaaaaaaa aaaaaaaaaa aaaaaaaaaa 60 
aa 62 



<210> 186 
<211> 62 
<212> DNA 
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<213> Mus musculus 
<400> 186 

tgatttataa tttcttcttc cagaacacct ctgaaagcta aaaaaaaaaa aaaaaaaaaa 60 
aa 62 



<210> 187 

<211> 62 

<212> DNA 

<213> Mus musculus 

<400> 187 

tgatttataa tttcttcttc cagaacacct ctgaaagctt aaaaaaaaaa aaaaaaaaaa 60 



aa 



<210> 188 

<211> 62 

<212> mA 

<213> Mus musculus 



62 



<400> 188 

tgatttataa tttcttcttc cagaacacct ctgaaagctt gaaaaaaaaa aaaaaaaaaa 60 



aa 



<210> 189 

<211> 62 

<212> DNA 

<213> Mus musculus 



62 



<400> 189 

tgatttataa tttcttcttc cagaacacct ctgaaagctt ggaaaaaaaa aaaaaaaaaa 60 
aa 62 



<210> 190 

<211> 62 

<212> DI^ 

<213> Mus musculus 

<400> 190 

tgatttataa tttcttcttc cagaacacct ctgaaagctt ggtcaagaaa aaaaaaaaaa 60 



aa 



<210> 191 

<2ll> 22 

<212> DNA 

<213> Mus musculus 



62 



<400> 191 

tttataattt caacaaccag aa 



22 
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<210> 192 
<211> 9 
<212> DNA 



<213-> — MUS 


mus cuius 






<400> 192 
gaaagcttg 






9 


<210> 193 
<211> 15 
<212> DNA 
<213> Mus 


musculus 






<400> 193 
ggtacatttt 

* 


ttttc 




15 


<210> 194 
<211> 19 
<212> DNA 
<213> Mus 


musculus 






<400> 194 
atgcgggaat 


gatttataa 




19 


<210> 195 
<211> 1798 
<212> DNA 

<213> Saccharomyces cerevisiae 




<400> 195 
tatctggttg 


atcctgccag 


tagtcatatg cttgtctcaa agattaagcc atgcatgtct 


o u 


aagtataagc 


aatttataca 


gtgaaactgc gaatggctca ttaaatcagt tatcgtttat 


ion 


ttgatagttc 


ctttactaca 


tggtataacc gtggtaattc tagagctaat acatgcttaa 


XoU 


aatctcgacc 


ctttggaaga 


gatgtattta ttagataaaa aatcaatgtc ttcgcactct 


o A n 


ttgatgattc 


ataataactt 


ttcgaatcgc atggccttgt gctggcgatg gttcattcaa 


300 


atttctgccc 


tatcaacttt 


cgatggtagg atagtggcct accatggttt caacgggtaa 


360 


cggggaataa 


gggttcgatt 


ccggagaggg agcctgagaa acggctacca catccaagga 


420 


aggcagcagg 


cgcgcaaatt 


acccaatcct aattcaggga ggtagtgaca ataaataacg 


480 


atacagggcc 


cattcgggtc 


ttgtaattgg aatgagtaca atgtaaatac cttaacgagg 


540 


aacaattgga 


gggcaagtct 


ggtgccagca gccgcggtaa ttccagctcc aatagcgtat 


600 


attaaagttg 


ttgcagttaa 


aaagctcgta gttgaacttt gggcccggtt ggccggtccg 


660 


attttttcgt 


gtactggatt 


tccaacgggg cctttccttc tggctaacct tgagtccttg 


720 
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tggctcttgg 
ttgctcgaat 
tctaggacca 
gtgaaattct 
cattaatcaa 
cataaactat 
agaaatcaaa 
ttgacggaag 
ctcaccaggt 
gtggtggtgc 
gaacgagacc 
agggactatc 
agaacgttct 
agaggtcttg 
ctcttcaacg 
ccctttgtac 
ctgcttagag 
tagaggaact 



cgaaccagga 
atattagcat 
tcgtaatgat 
tggatttatt 
gaacgaaagt 
gccgactaga 
gtctttgggt 
ggcaccacta 
ccagacacaa 
atggccgttt 
ttaacctact 
ggtttcaagc 
gggccgcacg 
gtaatcttgt 
aggaattcct 
acaccgcccg 
aagggggcaa 
aaaagtcgta 



cttttacttt 
ggaataatag 
taatagggac 
gaagactaac 
taggggatcg 
tcgggtggtg 
tctgggggga 
ggagtggagc 
taaggattga 
ctcagttggt 
aaatagtggt 
cgatggaagt 
cgcgctacac 
gaaactccgt 
agtaagcgca 
tcgctagtac 
ctccatctca 
acaaggtttc 



49 

gaaaaaatta 
aataggacgt 
ggtcgggggc 
tactgcgaaa 
aagatgatct 
tttttttaat 
gtatggtcgc 
ctgcggctaa 
cagattgaga 
ggagtgattt 
gctagcattt 
ttgaggcaat 
tgacggagcc 
cgtgctgggg 
agtcatcagc 
cgattgaatg 
gagcggagaa 
cgtaggtgaa 



gagtgttcaa 
ttggttctat 
atcggtattc 
gcatttgcca 
ggtaccgtcg 
gacccactcg 
aaggctgaaa 
tttgactcaa 
gctctttctt 
gtctgcttaa 
gctggttatc 
aacaggtctg 
agcgagtcta 
atagagcatt 
ttgcgttgat 
gcttagtgag 
tttggacaaa 
cctgcggaag 



agcaggcgta 
tttgttggtt 
aattgtcgag 
aggacgtttt 
tagtcttaac 
gtaccttacg 
cttaaaggaa 
cacggggaaa 
gattttgtgg 
ttgcgataac 
cacttcttag 
tgatgccctt 
accttggccg 
gtaattattg 
tacgtccctg 
gcctcaggat 
cttggtcatt 
gatcatta 



<210> 196 

<211> 1869 

<212> DNA 

<213> Mus musculus 

<400> 196 

tacctggttg atcctgccag tagcatatgc tfcgtctcaaa gattaagcca tgcatgtcta 
agtacgcacg gccggtacag tgaaactgcg aatggctcat taaatcagtt atggttcctt 
tggtcgctcg ctcctctcct acttggataa ctgtggtaat tctagagcta atacatgccg 
acgggcgctg accccccttc ccgggggggg atgcgtgcat ttatcagatc aaaaccaacc 
cggtgagctc'- cctcccggct ccggccgggg gtcgggcgcc ggcggcttgg tgactctaga 
taacctcggg ccgatcgcac gccccccgtg gcggcgacga cccattcgaa cgtctgccct 
atcaactttc gatggtagtc gccgtgccta ccatggtgac cacgggtgac ggggaatcag 
ggttcgattc cggagaggga gcctgagaaa cggctaccac atccaaggaa ggcagcaggc 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1798 



60 
120 
180 
240 
300 
360 
420 
480 
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gcgcaaatta 


cccactcccg acccggggag gtagtgacga aaaataacaa tacaggactc 


540 


tttcgaggcc 


ctgtaattgg aatgagtcca ctttaaatcc tttaacgagg atccattgga 


600 


gggcaagtct 


ggtgccagca gccgcggtaa ttccagctcc aatagcgtat attaaagttg 


660 


ctgcagttaa 


aaagctcgta gttggatctt gggagcgggc gggcggtccg ccgcgaggcg 


720 


agtcaccgcc 


cgtccccgcc ccttgcctct cggcgccccc tcgatgctct tagctgagtg 


780 


tcccgcgggg 


cccgaagcgt ttactttgaa aaaattagag tgttcaaagc aggcccgagc 


840 


cgcctggata 


ccgcagctag gaataatgga ataggaccgc ggttctattt tgttggtttt 


900 


cggaactgag 


gccatgatta agagggacgg ccgggggcat tcgtattgcg ccgctagagg 


960 


tgaaattctt 


ggaccggcgc aagacggacc agagcgaaag catttgccaa gaatgttttc 


1020 


attaatcaag 


aacgaaagtc ggaggttcga agacgatcag ataccgtcgt agttccgacc 


1080 


ataaacgatg 


ccgactggcg atgcggcggc gttattccca tgacccgccg ggcagcttcc 


1140 


gggaaaccaa 


agtctttggg ttccgggggg agtatggttg caaagctgaa acttaaagga 


1200^ 


attgacggaa 


gggcaccacc aggagtgggc ctgcggctta atttgactca acacgggaaa 


1260 


cctcacccgg 


cccggacacg gacaggattg acagattgat agctctttct cgattccgtg 


1320 


ggtggtggtg 


catggccgtt cttagttggt ggagcgattt gtctggttaa ttccgataac 


1380 


gaacgagact 


ctggcatgct aactagttac gcgacccccg agcggtcggc gtcccccaac 


1440 


ttcttagagg 


gacaagtggc gttcagccac ccgagattga gcaataacag gtctgtgatg 


1500 


cccttagatg 


tccggggctg cacgcgcgct acactgactg gctcagcgtg tgcctaccct 


1560 


gcgccggcag 


gcgcgggtaa cccgttgaac cccattcgtg atggggatcg gggattgcaa 


1620 


ttattcccca 


tgaacgagga attcccagta agtgcgggtc ataagcttgc gttgattaag 


1680 


tccctgccct 


ttgtacacac cgcccgtcgc tactaccgat tggatggttt agtgaggccc 


1740 


tcggatcggc 


cccgccgggg tcggcccacg gccctggcgg agcgctgaga agacggtcga 


1800 


acttgactat 


ctagaggaag taaaagtcgt aacaaggttt ccgtaggtga acctgcggaa 


1860 


ggatcatta 




1869 



<210> 197 

<211> 1869 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> modif ied_base 

<222> (27).. (27) 

<223> m2a--2 * -o-methyladenosine (genebank # 36162) 
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<220> 

<221> modif ied_base 

<222> {99)., (99) 

<223> m2a — 2 ' -o-methyladenosine 



<220> 

<221> modif ied_base 

<222> (159) . . (159) 

<223> m2a--2 ' -o-methyladenosine 



<220> 

<221> ttiodif ied_base 

<222> (166) . . (166) 

<223> m2a--2 ' -o-methyladenosine 



<220> 

<221> modif ied_base 

<222> (468) . . (468) 

<223> m2a-"2 • -o-methyladenosine 



<220> 

<221> modif ied_base 

<222> (484) . . (484) 

<223> m2a--2 ' -o-methyladenosine 



<220> 

<221> modif ied_base 

<222> (512) . . (512) 

<223> m2a--2 ' -o-methyladenosine 



<220> 

<221> modif ied__base 

<222> (576) . . (576) 

<223> m2a--2 ' -o-methyladenosine 



<220> 

<221> modif ied_base 

<222> (590) • • (590) 

<223> m2a — 2 • -o-methyladenosine 



<220> 

<221> modif ied_base 

<222> (668) . . (668) 

<223> m2a--2 * -o-methyladenosine 



<220> 

<221> modif ied__base 

<222> (1031) . . (1031) 

<223> m2a--2 • -o-methyladenosine 
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<220> 

<221> modlf ied_base 

<222> (1383) . . (1383) 

<223> m2a--2 * -o-methyladenosine 



<220> 

<221> modif ied_base 

<222> (1678) . . (1678) 

<223> m2a~-'2 » -o-methyladenosine 



<220> 

<221> modif ied_base 

<222> (1832) . . (1832) 

<223> m2a--2 • -o-methyladenosine 



<220> 

<221> modif ied_base 

<222> (1850) . . (1850) 

<223> m2a--2 • -o-methyladenosine 



<220> 

<221> modif ied_base 

<222> (116) . . (116) 

<223> um 



<220> 

<221> modif ied_base 

<222> (121) . . (121) 

<223> um 



<220> 

<221> modif ied_base 

<222> (172) . . (172) 

<223> um 



<220> 

<221> modif ied_base 

<222> (428) . . (428) 

<223> um 



<220> 

<221> modif ied__base 

<222> (627) • . (627) 

<223> um 



<220> 
<221> 
<222> 



modif ied^base 
(799) . . (799) 
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<223> urn 



<220> 

<221> tnodif ied_base 

<222> (1288) (1288) 

<223> urn 



<220> 

<221> modif ied_base 

<222> (1326) . . (1326) 

<223> um 



<220> 
<221> 

<222> 
<223> 



modified_base 
(1442) . . (1442) 
um 



<220> 

<221> modif ied_base 

<222> (1668) . . (1668) 

<223> um 



<220> 
<221> 
<222> 
<223> 



modif ied_base 
(1804) . . (1804) 
um 



<220> 

<221> modif iedjDase 

<222> (174) - . (174) 

<223> cm 2 ' -o-cytidine 



<220> 

<221> modif ied__base 

<222> (462) . . (462) 

<223> cm 



<220> 

<221> modif ied_base 

<222> (517) . . (517) 

<223> cm 



<220> 

<221> modif ied_base 

<222> (797) . . (797) 

<223> cm 



<220> 

<221> modified base 
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<222> (1391) . . (1391) 
<223> cm 



<220> 

<221> modif ied__base- 

<222> (1703) . . (1703) 

<223> cm 



<220> 

<221> modif ied_base 

<222> (436) . . (436) 

<223> gm 



<220> 

<221> modif ied_base 

<222> (509) . . (509) " 

<223> gm 



<220> 

<221> modif ied_base 

<222> (601) . . (601) 

<223> gm 



<220> 

<221> modif ied_base 

<222> (644) . . (644) 

<223> gm 



<220> 

<221> modif ied_base 

<222> (683) . . (683) 

<223> gm 



<220> 

<221> modif ied_base 

<222> (867) . . (867) 

<223> gm 



<220> 

<221> modif ied_base 

<222> (1328) . , (1328) 

<223> gm 



<220> 

<221> modif ied_base 

<222> (1447) . . (1447) 

<223> gm 



<220> 
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<221> modif ied_base 

<222> (1490) . . (1490) 

<223> gm 

<220> 

<221> modif ied_base 

<222> (1639) . • (1639) 

<223> gm 

<220> 

<221> modif ied_base 

<222> (1248) . . (1248) 

<223> X 3- (3-amino-3-carboxypropyl) -1-methylpseudouridine 



<400> 197 

tacctggttg atcctgccag tagcatatgc ttgtctcaaa 


gattaagcca 


tgcatgtcta 


60 


agtacgcacg gccggtacag tgaaactgcg aatggctcat 


taaatcagtt 


atggttcctt 


120 


cggccgctcg ctcctctccc acttggataa ctgtggtaat 


tctagagcta 


atacatgccg 


180 


acgggcgctg acccccttcg cgggggggat gcgtgcattt 


atcagatcaa 


aaccaacccg 


240 


gtcagcccct ctccggcccc ggccgggggg cgggcgccgg 


cggctttggt 


gactctagat 


300 


aacctcgggc cgatcgcacg ccccccgtgg cggcgacgac 


ccattcgaac 


gtctgcccta 


360 


tcaactttcg atggtagtcg ccgtgcctac catggtgacc 


acgggtgacg 


gggaatcagg 


420 


gttcgattcc ggagagggag cctgagaaac ggctaccaca 


tccaaggaag 


gcagcaggcg 


480 


cgcaaattac ccactcccga cccggggagg tagtgacgaa 


aaataacaat 


acaggactct 


540 


ttcgaggccc tgtaattgga atgagtccac tttaaatcct 


ttaacgagga 


tccattggag 


600 


ggcaagtctg gtgccagcag ccgcggfcaat tccagctcca 


atagcgtata 


ttaaagttgc 


660 


tgcagttaaa aagctcgtag ttggatcttg ggagcgggcg 


ggcggtccgc 


cgcgaggcga 


720 


gccaccgccc gtccccgccc cttgcctctc ggcgccccct 


cgatgctctt 


agctgagtgt 


780 


cccgcggggc ccgaagcgtt tactttgaaa aaattagagt 


gttcaaagca 


ggcccgagcc 


840 


gcctggatac cgcagctagg aataatggaa taggaccgcg 


gttctatttt 


gttggttttc 


900 


ggaactgagg ccatgattaa gagggacggc cgggggcatt 


cgtattgcgc 


cgctagaggt 


960 


gaaattcttg gaccggcgca agacggacca gagcgaaagc 


atttgccaag 


aatgttttca 


1020 


ttaatcaaga acgaaagtcg gaggttcgaa gacgatcaga 


taccgtcgta 


gttccgacca 


1080 


taaacgatgc cgaccggcga tgcggcggcg ttattcccat 


gacccgccgg 


gcagcttccg 


1140 


ggaaaccaaa gtctttgggt tccgggggga gtatggttgc 


aaagctgaaa 


cttaaaggaa 


1200 


ttgacggaag ggcaccacca ggagtggagc ctgcggctta 


atttgactca 


acacgggaaa 


1260 
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cctcacccgg 


cccggacacg gacaggattg acagattgat agctctttct cgattccgtg 


1320 


ggtggtggtg 


catggccgtt cttagttggt ggagcgattt gtctggttaa ttccgataac 


1380 


gaacgagact 


ctggcatgct aacbagttac gcgacccccg agcggtcggc gtcccccaac 


1440 


ttcttagagg 


gacaagtggc gttcagccac ccgagattga gcaataacag gtctgtgatg 


1500 


cccttagatg 


tccggggctg cacgcgcgct acactgactg gctcagcgtg tgcctaccct 


1560 


acgccggcag 


gcgcgggtaa cccgttgaac cccattcgtg atggggatcg gggattgcaa 


1620 


ttattcccca 


tgaacgagga attcccagta agtgcgggtc ataagcttgc gttgattaag 


1680 


tccctgccct 


ttgtacacac cgcccgtcgc tactaccgat tggatggttt agtgaggccc 


1740 


tcggatcggc 


cccgccgggg tcggcccacg gccctggcgg agcgctgaga agacggtcga 


1800 


acttgactat 


ctagaggaag taaaagtcgt aacaaggttt ccgtaggtga acctgcggaa 


1860 


ggatcatta 




1869 



INTERNATIONAL SEARCH REPORT 



smatlonal AppQcatlon No 

PCT/US 01/02586 



A. CLASSIFICA-nON OF SUBJECT MATTER, . ^ ,^ , «... - - 

IPC 7 C12N15/11 C12N15/64 C12N15/67 
C12N5/10 A01K67/027 



C12N15/79 C12N15/85 



According to International Paieitt dasaWcalton (IPC) or to both national ctesdncalton and IPC 



a FIELDS SEARCHED 



Minimum documentation searched (dassffication system followed tiy dasslTication symtrals) 

IPC 7 C12N AOIH 



Oocumentatton searched other than rnlnbnum dooumentaiidn to the extent that such documents are inchided In the fieMs searched 



Electronic data t>ase consulted during the imematlonal search (nante of data base andi wheie practical searct) terms used) 

BIOSIS, EMBASE, CHEM ABS Data 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category" Citation of document w&hfndtoatlon. where appropriate, of the relevant passages 



RelevanttadalmNo. 



No relevant documents disclosed 



□ 



Further documents are listed In the continuation of box 0. 



□ 



Patent family members are listsd In annex. 



* Special categories of cited documents : 

'A* document deflhtng the general slate of the art which is not 

considered to be of particular relevance 
'E' earHer document but published on or after the Intemational 

filing dale 

*L* document which may throw doubts on priority claim(s) or 
which is cited to establish the pubHcation date of another 
citation or other special reason (as specified) 

*0^ document r^nroig to an oral disclosure, use. exhibllion or 
other means 

'P* document published prior to the fntemafional filing date but 

later than the priorUy date claimed 



*T" later document published after the Inlennational fifing date 
or priortty date and not in conflict with the appllcallon but 
cited to understand the principle or theory underlying the 

Invention 

'X* document of particu tar relevance; ttte claimed invention 
cannot be considered novel or cannot be considered to 
involve an Inventhre step when the document Is taken alone 

■Y' documenlof particular relevance; ttie claimed invention 
cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combinatibn beong otKriousto a person sldHed 
in the art 

document member of the same patent faihily 



Date of the achial completion of the international search 



28 June 2001 



Date of mailing of the International search report 



06/07/2001 



Name and malflng address of the ISA 

European Patent Office. P.B. 5818 F^alenilaan 2 
NL~2280HVR5swqk 
Tel (431-70) 340-2040. "Dc. 31 651 epo nl. 
Far (+31-70) 34O-3016 



Authorized offloer 



Hillenbrand, G 



Fomi PCT/)SA/210 (second shsat) guty 1992) 



International Application No. PCTUS 01 i02586 



FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



Continuation of Box 1.2 
Claims Nos.: 1-114 



1) . In view of the large number of claims (114 claims) at present on 
file and the large number of different oligonucleotide sequences claimed 
(194) and also the wording of the claims presently on file, which render 
it difficult, if not impossible, to determine the matter for which 
protection is sought, the present application falls to comply with the 
clarity and conciseness requirements of Article 6 PCT (see also Rule 
6.1(a) PCT) to such an extent that a meaningful search Is Impossible. 

2) . Moreover, present claims relate to products and methods defined 
only by reference to a desirable characteristic or property, namely 
translatlonal regulatory activity in a eukaryotic host cell selected from 
"(1) translatlonal enhancing activity, (11) translatlonal Inhibitory 
aclvUy, (111) internal ribosome entry site (IRES) activity, and (1v) any 

combination thereof". The claims cover all products and methods having 
this characteristic or property, whereas the application provides support 
within the meaning of Article 6 PCT and/or disclosure within the meaning 
of Article 5 PCT for only a very limited number of such products and 
methods. In the present case, the claims so lack support, and the 
application so lacks disclosure, that a meaningful search over the whole 
of the claimed scope is impossible. Independent of the above reasoning, 
the claims also lack clarity (Article 6 PCT). An attempt 1s made to 
define the products and methods by reference to a result to be achieved. 
Again, this lack of clarity in the present case is such as to render a 
meaningful search over the whole of the claimed scope impossible. 

3) . In view of the reasons given under paragraphs (1) and (2) no search 
report has been established for the present application. 

4) . Furthermore, according to Rule 13 PCT an application must relate to 
one invention only or to a group of Inventions so linked as to form a 
single general inventive concept, i.e. having at least one common 
technical defining a contribution over the known prior art. In the 
present case, the common technical feature among the different groups of 
inventions seems to be the fact that the claimed oligonucleotides 
comprise "a translatlonal regulatory element comprising at least at least 
one oligonucleotide consisiting of about 6-125 ribonucleotides, or a 
deoxyribonucleotide sequence encoding said oligonucleotide having 
translatlonal regulatory activity 1n a eukaryotic cell". However, this 
single general concept lacks novelty and inventive activity with respect 
to the known prior art (see e.g. documents W099/61613, W098/37189, 
W094/23C41, W098/54342,W000/44895, and WOOO/05366). Therefore 4 different 
main groups of inventions can be identified (see paragraph 2), which can 
be further subdivided into at least 194 different Inventions comprising 
194 different oligonucleotide sequences and a multiplicity of possible 
different combinations thereof. 

5) . Thus, the ISA fails to see which 1s the common Inventive feature 
among the different groups of inventions defining an inventive 
contribution over the known prior art. 



page l of 2 



international Appncalion No. PCTAJS 01 lOZbSS 

FURTHER INFORWIATIOM CONTINUED FROM PCT/ISA/ 210 



The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to Inventions In respect of which no International 
search report has been established need not be the subject of an 
International preliminary examination (Rule 66.1(e) PCT). The applicant 
Is advised that the EPO policy when acting as an International 
Preliminary Examining Authority 1s normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 



page 2 of 2 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 



Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 

□ FAEdED TEXT OR DRAWING 



□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



BEST AVAILABLE IMAGES 




BLURRED OR ILLEGIBLE TEXT OR DRAWING 



THIS PAGE BLANK (usPTO) 



